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Glass-forming systems, which are characterized by a highly disordered energy land- 
scape, have been studied in physics by a simulation-based state space aggregation. The 
purpose of this article is to develop a path-independent approach within the framework 
of aperiodic, reversible Markov chains with exponentially small transition probabilities 
which depend on some energy function. This will lead to the definition of certain metas- 
tates, also called metabasins in physics. More precisely, our aggregation procedure will 
provide a sequence of state space partitions such that on an appropriate aggregation 
level certain properties (see Properties 1-4 of the Introduction) are fulfilled. Roughly 
speaking, this will be the case for the finest aggregation such that transitions back to 
an already visited (meta-)state are very unlikely within a moderate time frame. 
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Introduction 

Supercooled liquids of glass forming systems are typical examples of high-dimensional systems with 
highly disordered energy landscapes and our main concern behind this work. Simulations have 
shown that many important characteristics of such a system are better described by a process on 
the set of so-called metabasins (MB) than by the more common process on the set of visited minima 
of the energy landscape (see [12] )• Those MB are formed in the following way by aggregation of 
suitable states of the describing process (X n ) n >o along a simulated trajectory: 

Fixing a reasonable observation time T, define \o = and then recursively for n > 1 
Xn := inf {k > Xn-x I {X k ,...,X T } n {X ,..., X fc _ x } = 0}. 
Then the MB up to v :— sup{rt > | \ n < T} are chosen as 

V n :={X Xri ,...,X Xri+1 ^}, 0<n<v. 

Simulation studies have shown that local sampling within a MB does not affect typical parameters 
of the process like the diffusion coefficient or the time to reach equilibrium. Dynamical aspects 
are therefore fully characterized by the MB-valued process. Furthermore, this model reduction by 
aggregation, as proposed in [T^] and [TB], offers several advantages (referred to as Properties 1-5 
hereafter): 
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1. The probability of a transition from one MB to any other one does not depend on the state 
at which this MB is entered. 

2. There are basically no reciprocating jumps between two MB. This is in strong contrast to 
the unaggregated process where such jumps occur very often: The system falls back to a 
minimum many times before eventually cresting a high energy barrier and then falling into a 
new valley, where it will again take many unsuccessful trials to escape. These reciprocating 
jumps are not only irrelevant for the actual motion on the state space but also complicating 
the estimation of parameters like the diffusion coefficient or the relaxation time. 

3. The expected time spent in a MB is proportional to its depth. Thus there is a strong and 
explicit relation between dynamics and thermodynamics, not in terms of the absolute but 
the relative energy. 

4. The energy barriers between any two MB are approximately of the same height, that is, there 
is an energy threshold E such that, for a small e, it requires a crossing of at least Eq — e 
and at most Eq + e to make a transition from one MB to another. Such systems with e = 
are called trap models (see [3]). 

5. The sojourn times and the jump distances between successively visited MB (measured in 
Euclidean distance) form sequences of weakly or even uncorrelated random variables, and 
are also mutually independent, at least approximately. Therefore, the aggregated process 
can be well approximated by a continuous time random walk, which in turn simplifies its 
analysis and thus the analysis of the whole process. 

Despite these advantages, the suggested definition of MB has the obvious blemish that it depends 
on the realization of the considered process and may thus vary from simulation to simulation. To 
provide a mathematically stringent definition of a path-independent aggregation of the state space, 
which maintains the above properties and is based on the well-established notion of metastable 
states, is therefore our principal concern here with the main results being Theorem |3.13| and 



Theorem 3.17 In this endeavor, we will draw on some of the ideas developed by Bovier in [I] 
and by SCOPPOLA in [21], most notably her definition of metastable states. 

Metastability, a phenomenon of ongoing interest for complex physical systems described by finite 
Markov processes on very large state spaces, can be defined and dealt with in several ways. It 
has been derived from a renormalization procedure in |20| . by a pathwise approach in [3], and 
via energy landscapes in (3], the latter being also our approach hereafter. To characterize a 
supercooled liquid, i.e. a glass forming system at low temperature, via its energy landscape was 
first done by Goldstein in 1969 [TT] and has by now become a common method. The general task 
when studying metastability, as well originally raised in physics ([13], [IS] or [TB]), is to provide 
mathematical tools for an analysis of the property of thermodynamical systems to evolve in state 
space along a trajectory of unstable or metastable states with very long sojourn times. 

Inspired by simulations of glass forming systems at very low temperatures with the Metropolis 
algorithm, we will study (as in [ST]) finite Markov chains with exponentially small transition 
probabilities which are determined by an energy function and a parameter /3 > 0. This parameter 
can be understood as the inverse temperature and we are thus interested in the behavior of the 
process as j3 — > oo. We envisage an energy function of highly complex order and without the 
hierarchical ordering that is typical in spin glass models. A good picture is provided by randomly 
chosen energies with correlations between neighbors or by an energy landscape that looks like a real 
mountain landscape. We will show that, towards an aggregation outlined above, the metastable 
states as defined in |21| are quite appropriate because they have an ordering from a kind of "weak" 
to a kind of "strong" metastability. Around those states we will define and then study connected 
valleys [Definition |1.5| characterized by minimal energy barriers. In the limit of low temperatures, 
any such barrier will determine the speed, respectively probability of a transition between the two 
valleys it separates. More precisely, in the limit f3 — > oo, the process, when starting in a state x, 
will almost surely reach a state with lower barrier earlier than a state with higher barrier [Theorem 



2.1 . In the limit of low temperatures, the bottom (minimum) of an entered valley will therefore 
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almost surely be reached before that valley is left again [Proposition 3.4 . As a consequence, the 
probability for a transition from one valley to another is asymptotically independent of the state 
where the valley is entered. This is Property 1 above. 

Furthermore, since valleys as well as metastable states have a hierarchical ordering, we can build 
valleys of higher order by a successive merger of valleys of lower order [Proposition 1.12] . Given 
an appropriate energy landscape, this procedure can annihilate (on the macroscopic scale) the 
accumulation of reciprocating jumps by merging valleys exhibiting such jumps into a single valley 
[Subsection 3.2 . Hence, valleys of sufficiently high order will have Property 2. 

Beside the macroscopic process [Section 3], which describes the transitions between valleys, one 
can also analyze the microscopic process [Section [2], that is, the system behavior when moving 
within a fixed valley. Here we will give a formula for the exit time and connect it with its parameters 
[Theorem 2.12 . This will confirm Property 3. 



Having thus established Properties 1-4 [Theorem 3.13 , we will finally proceed to a comparison 
of our path-independent definition of MB with the path-dependent one given above. It will be 



shown [Theorem 3.17 that both coincide with high probability under some reasonable conditions 
on the connectivity of valleys which, in essence, ensure the existence of reasonable path-dependent 



MB. We will also briefly touch on the phenomenon of quasi-stationarity [Proposition 2.9 which 
is a large area [T7] but to our best knowledge less studied in connection with the aggregation of 
states of large physical systems driven by energy landscapes. 

Let us mention two further publications which, despite having a different thrust, provide defi- 
nitions of valleys, called basins of attraction or metastates there, to deal with related questions. 
Olivieri & SCOPPOLA [16] fully describe the tube of exit from a domain in terms of which basins of 
attraction of increasing order are visited during a stay in that domain and for how long these basins 
are visited. In a very recent publication, Beltran & Landim [2], by working with transition rates 
instead of energies, aim at finding a universal depth (and time scale) for all metastates. However, 
we rather aim at the finest aggregation such that transitions back to an already visited metastate 
are very unlikely within a time frame used in simulations. This finest aggregation will lead to 
valleys of very variable depth just as simulations do not exhibit a universal depth or timescale. 



1 Valleys 



Let X be a Markov chain on a finite set S with transition matrix P = (p(r, s)) r , s es an d stationary 
distribution 7T, and let E : S — > M. be an energy function such that the following conditions hold: 

Irreducibility: P is irreducible with p(s, s) > and p(r, s) > iff p(s, r) > for all r,seS. 

Transition Probabilities: There exist parameters f3 > and 7,3 > with 7^ 0,/3jp — s- 00 
as j3 — > 00 such that 

e -p((E(s)-E(r)) + + 7fl /\S\) < p ( r;S ) < e -P((E(s)-E(r)) + -- f0 /\S\) 

for all distinct r, s £ S with p{r, s) > 0. Furthermore, 

p*(r,s) :— lim p(r,s) 

exists for all r,s £ S, is positive if E(r) > E(s) and = otherwise. 
Reversibility: The pair (n,P) satisfies the detailed balance equations, i.e. 

n(r)p(r, s) = 7r(s)f>(s, r) 

for all r,s £ S. 



Non-Degeneracy: E(r) ^ E{s) for all r,se<S,r/ s. 



4 



1 VALLEYS 



We are thus dealing with a reversible Markov chain with exponentially small transition proba- 
bilities driven by an energy landscape. As an example, which is also the main motivation behind 
this work, one can think of a Metropolis chain with transition probabilities of the form 

P{r ' S) ~C(r) 

Here /3 is the inverse temperature and C(r),r <E S, is a parameter, independent from /3, giving 
the number of neighbors of r. For 7^/|<S| := max re s ln(C(r))(/3 + l)" 1 / 2 the above conditions are 
fulfilled. Let us start with the following basic result for the ratios of the stationary distribution. 

Lemma 1.1. For any two states r,s G5 with E(r) > E(s), we have 

e -/3(E(r)-E( a )+2 7 p) < TTfr) < e _/3(E( r )-E( s )-2 7j9 ) _ 
- 7r(s) ~ 

Proof: To start with, assume r ~ s. Reversibility and the assumptions on the transition probabil- 
ities imply 

wM ntz r\ P -0((E(r)-E(s)) + -~f /\S\) 

p{S,r) e " -fS(E(r)-E{s)~2 lel /\S\) 

7r(s) p(r,s) ~ e -^((E(/p)-B(r))++7fi/|S|) 



and 



7r(r) p(s,r) e ~P((E(r)-E(s))++ lel /\s\) 



> f = e -P{E{r)-E( t s)+2yp/\S\) 



?r(s) " p(r,s) ~ e-«(E(«)-BW)+-7?/|S|) 

Now let r and s be arbitrary. By the irreducibility, there is a path r = ro,ri,...,r n — s from r to s of 
neighboring states with 7r(r l )/7r(r 4+1 ) G [ e -^(E(n)-s(r i+1 )+2 7 ^/|S|) ) e -0(s(r i )-B(r i+1 )-2 7? /|S|)] 5 g < 

i < n — 1. This finishes the proof. □ 

Under the stated assumptions, SCOPPOLA [2T] has shown the existence of a successive filtration 
(aggregation) S = M (0 ) D M (1 ) D ... D A/( n) = {s }, n £ N, of the state space such that the 
elements of each set M™, 1 < i < n, can be called metastable in the following sense: 

• They arise from the local minima of the energy function or certain modifications of it. 

• There is a lower bound on the expected time needed for a transition from mi to mi for any 
mi, m,2 £ M^' which increases very fast with i. 

• There exists a constant C such that 

F m (X t i M(' i+1 >) < e- Cfj 
for all m £ M"W , < i < n — 1, and sufficiently large t. 
This filtration starts with 

M (1) := {s £ S\E(s) < E(r) for all r - s}, 

and deletes one local minimum at each step. In fact, the local minimum with minimal activation 
energy for a transition to another minimum is deleted, see [21 and [20 for further details. 

Example 1.2. For the simple energy function depicted in Figure [l] a successive application of the 
algorithm from [21] as illustrated in Figure [2] leads to the following decomposition into subsets of 
metastable states: 

M (1) = {2, 4, 6, 8, 10, 12, 14}, M (2) = {2, 4, 6, 10, 12, 14}, 

M (3) = {2, 4, 6, 10, 14}, M (4) = {2, 4, 10, 14}, 

Af (5) = {4, 10,14}, M (6) = {4,14}, 
= {4}. 
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Figure 1: Example of an energy landscape with minima shown as black dots (•) 

Based on the filtration of S just described, we now proceed to a definition of a sequence of 
metastable sets associated with the metastable states which will induce the MB. In order to do so, 
we must study first minimal paths between two states and maximal energies along such paths. 

Definition 1.3. (a) For any two distinct states r,s £ S, let 

r(r, s) := {(x ,...,x k )\ k e N, x = r, x k = s, x { ^ Xj, p(xi,x i+1 ) > for < i < k - 1, i ^ j} 

be the set of all finite self-avoiding paths from r to s having positive probability. For any 
such path 7 = (7 ,...,7fc) € T(r,s), let I7I := k be its length. We further write t <G 7 if 
* € {7iv,7fc}- 

(b) A self-avoiding path 7 = (7i,...,7fe) from r to s is called minimal if its maximal energy 
maxKKt E(-fi) is minimal among all 7' e T(r, s). The set of these paths is denoted T* (r, s). 

(c) The essential saddle z* (r, s) between r and s is then defined as 



z*(r,s) :— argmaxi?(t) G S 



for any 7 e T*(r, s). 



As for (c), it is to be noted that, due to the assumed non-degeneracy of the energy function, 
the essential saddle is unique, which means that it does not depend on (as it must) which minimal 
path we choose in the definition of z*(r,s). There may indeed be several minimal paths, every 
single one thus crossing the saddle at some time. With the help of these notions the valleys can 
now be defined in a quite concrete way. Let us label the local minima as m^,..., m^ n \ so that 
AfW = {mW,...,m( n )} for each i = l,...,n. 

Definition 1.4. For each m € AfW, 1 < i < n, let 

V^\m) :={seS E(z*(s,m)) < E(z*(s,m')) for all m' e M«\{m}} . 
We say that state s is attracted by m at level i, expressed as s m at level i, if 

E(z*(s,m))= min E{z*{s,n)) 

and every minimal path from s to a state m! G M^ l '\{m} with E(z*(s, m')) = E(z* (s,m j) hits 
V<\m) at some time. Finally, let 

l(i) := inf {i < j < n\m^ -^mat level j for some m £ 

denote the minimal level at which the minimal state becomes attracted by a minimal state of 
superior level. 
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• metastable states M w . 1 < i < 7 
A minimal hill 
□ valley 



Figure 2: Successive application of the algorithm in [5T] to the energy landscape in Figure [T] For 
each step i, the metastable states as well as the corresponding valleys are shown. 



Definition 1.5. (a) Initialization: For each m € AfW, define 

V (1) (m) := js G S s m at level I 
as the valley of order 1 containing m and let 



iV« : = (J vW(m 



be the set of non-assigned states at level 1. 

(b) Recursion: For 2 < i < n and m € define 



V 



(i) 



(m) := V (<-1) (m) u{se iV (j 



s^+mat level i > U 



u 



7(i)( TO (i)) 



j:l{j)—i^m^) ~~>m at level 2 



as the valley of order i containing m and let 



U =1 



be the set of non-assigned states at level i. 
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Here is a more intuitive description of what the previous two definitions render in a formal way: 
First, we define, for each level i and m 6 M^, the set V< (m) of those states s that are strongly 
attracted by to in the sense that E(z* (s,m)) is strictly smaller than E(z* (s,m')) for any other 
m' 6 AfW. Then, starting at level one, each valley V^(m), m 6 M^, is formed from V< (m) 
by adjoining all further states s attracted by to at this level. This leaves us with a set of non- 
assigned states, denoted A^ 1 ). In the next step (level 2), any V^(m) for to e M^> is obtained 
by adjoining to V^(m) all those s e A^ 1 ) which are attracted by to at level 2. Observe that 
this ensures V^\m) C V( 2 )(m). Moreover, if to' 1 - 1 is attracted by to at level 2, then V^(m^) 
is merged into y( 2 )(TO.) as well. If no such to exists (thus 1(1) > 2), it remains untouched until 
reaching level Z(l) where its bottom state mP^ becomes attracted by some to' € causing 
its valley to be merged into V^^'im'). This procedure continues in the now obvious recursive 
manner until at level n all states have been merged into one valley. Obviously, valleys of the same 
order are pairwise disjoint. Also, valleys once formed at some level can only be merged as a whole 
and will thus never be ripped apart during the recursive construction. For the energy function 
depicted in Figure [l] the successively derived valleys of order i — 1,..., 7 are shown in Figure [2] 

Before proceeding to results on the general shape of valleys, we collect some basic, mostly 
technical properties of essential saddles which will be useful thereafter. 

Proposition 1.6. For any r,s,u e S, < i < n, toi,to,2 € M^',mi ^ to,2, and Xi,x 2 € S with 
X\ 6 (mi) and X2 € V^(m 2 ), we have 

(a) z*(r.s) — z*(s,r). 

(b) E(z*(r,s)) < E(z*(r,u))V E(z*(u,s)). 

(c) E(z*(x 2 ,m 2 )) < E(z*(x 2 ,m')) for all m! E M«. 

(d) E(z*(xi,m 2 )) = E(z*(m 1 ,m 2 )). 

(e) E(z*(xi,x 2 )) > E(z*(m 1 ,m 2 )). 

(f) z*(xi,x 2 ) ^ x x . 

Proof: Parts (a) and (b) are obvious. 

For (c) we use an induction over i and note that there is nothing to show when i = 1. For general 
i, we must only verify that E(z*(x 2l m 2 )) < E(z*(x 2 ,m')) for all to' € if x 2 <E V^(m^) for 
some j < i such that l(j) — i and ~» to at level i (due to the recursive definition of V^ l \m)). 
But the latter ensures that E(z*(x 2 ,m ( - j) )) < E(z*(x 2 ,n)) for all n e D AfW (inductive 

hypothesis) as well as E(z* (m^\m 2 )) < E(z* (m^\m')) for all to' € Consequently, for any 

such to', 

E(z*(x 2 ,m 2 )) < E(z* {x 2 ,m {3) ))\/ E{z* {m {j \m 2 )) 

< E(z*(x 2 ,m {j) ))V E(z*{m (j \m')) 

< E(z*{x 2 ,m {j) )) V E(z*(x 2 ,m'))) 
= E(z*(x 2 ,m')) 

as asserted. 

For assertion (d), note that E(z* (x\ , vn 2 )) > £'(z*(x 1 ,to 1 )), which in combination with (a) and 
(b) implies 

E(z*(m,i,m 2 )) < E(z*(mi,xi)) V E(z*(xi,m 2 )) = E(z* (xi,m 2 )) 
and then further 

E(z*(x 1 ,m 2 )) < E{z*{ Xl , mi ))V E(z*( mi ,m 2 )) < E(z*(x 1 ,m 2 )). 

s v ' s V ' 

<E(z*(x 1 ,m 2 )) <E(z'(x 1 ,m 2 )) 
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So the above must be an identity, i.e. E(z*(xi, m 2 j) = E(z*(rrii, m 2 )). 
Turning to part (e), we first infer with the help of (c) and (d) that 

E(z*(xi,mi)) < E(z*(xi,m 2 )) 
= £(z*(mi,m 2 )) 

<E(z*( mi ,x 2 ))V E(z*(x 2 ,m 2 )) (1) 
= E(z*(x 2 ,m 1 )) 

< E(z*(x 2 , Xl )) V E(z*{x 1 ,m 1 )), 

thus 

E{z*(xi,m{)) < E(z*(x u x 2 )). (2) 

Together with the just shown inequality E(z*(mi,m 2 )) < E(z* (x 2 , mi)) (see ([!])) and another use 
of (c), this yields 

E(z*(x 1 ,x 2 )) = E{z*{x 2 ,x x )) V E(z*(x 1 ,m 1 )) > E(z*{x 2 ,m 1 )) > E(z* (m 1: m 2 )). 

Finally, we infer with the help of Q that 

E(z*(x 1 ,x 2 )) > E{z*(xi,mx)) > E( Xl ) 

and thus z*(xi,x 2 ) ^ x\ as claimed in (f). □ 

Remark 1.7. It is useful to point out the following consequence of the previous proposition. If, 
for an arbitrary state s and any two distinct metastable states m,n G there exists a minimal 

path 7 from s to n that hits a state r with E(z*(r,m)) < E(z* (r,n), then there is also a minimal 
path from s to n that passes through m. Namely, if we replace the segment from r to n of the 
former path by the concatenation of two minimal paths from r to m and from m to n, then the 
maximal energy of this new path is 

E(z*(s,n)) V E(z*(r,m)) V E(z*(m,n)) < E(z*(s,n))V E(z*(r,m))V E(z*(r,n)) 

= E(z*(s,n))V E(z*(r, n)) 
= E(z*(s,n)), 

by Proposition |1.6[ b), whence the new path has to be minimal from s to n as well. This yields 
two facts: 

(a) A minimal path from s to n, where s n at level i, hits V< (n) before it hits any r with 
E(z*(r, m)) < E(z*(r : n)) for some m G M^. Otherwise, since the subpath from r to m can 
be chosen to stay in {t\E(z* (t,m)) < E(z*(t,n))} and thus E(z*(s,m)) — E(z*(s,n)), there 
would be a path from s to m not hitting V< (n.). 

(b) If s ~+ n at level z and m € AfW\{n} with E(z*(s,n)) = E(z*(s,m)), then a minimal path 
from s to m does not only hit V< \n) at some time, but in fact earlier than any other valley 
V£\m'),m' G M^\{n}. 

Lemma 1.8. Let l<i<j<n, m = mfi and s G V^(m). Then s G F<^(m') /or some 
ml € AfW implies < j, m G V< (m') and £/i«s V"W(m) C F^(m'). 

In other words, whenever (m^) contains an element s which at some higher level j belongs to 
some V^ j \m'), ml G AfW, the same must hold true for itself implying VW(m^) C V^(m'). 
Conversely, this guarantees that V^(mW) will have no common elements with any V<\m') at 
levels j < l(i) where it has not yet been merged into a valley of higher order. 
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Proof: Let us first note that, under the given assumptions, 

E(z*{s,m)) < E(z*(s,m')) < E(z*{s,nj) 

for all n G M^>\{m'}, whence 

E(z*(m,n)) < E(z*(s,m))V E(z*(s,n)) = E(z*(s,n)) < E(z*(s,m)) V E(z*(m,n)) 

entails E(z*(m, n)) — E(z*(s, n)) for all such n. Using this fact, we find that 

E(z*(m,m')) < E(z*(s,m)) V E(z*{s,m')) < E(z*(s,n)) = E(z*(m,n)) 

for all n G M^\{m'}, which implies to to' at level j and thus l{i) < j as well as the other 
assertions. □ 

Proposition 1.9. For every to € and 1 < i < n, V<\m) is connected. 

Proof: Pick any s G V<\m), any minimal path from s to m and finally any intermediate state r 
along this path for which r G V< (m) must be verified. For every m! G AfW\{m}, we find 

E{z*{r,m)) < E(z*(r,s))V E(z*(s,m)) 
= E(z* (s, to)) 

< E[z*(s,m')) 

< E(z*(s,r)) VE(z*(r,m')) 

<E(z*(s,m')) 

= E{z*{r,m')), 

which shows r G V< (m) as required. □ 

Note that we have even shown that a minimal path from a state in (m) to to will never leave 
this set. We may expect and will indeed show as Proposition 1 1 . 1 1 1 below that V^\m) is connected 
as well. The following lemma is needed for its proof. 

Lemma 1.10. Given 1 < i < n, to G M^' and s to at level i, let 7 = (71,..., 7&) G T* (s, to) be a 
path such that E(z* (7^, to)) < E(z* (7^, n)) for all n G M^\{m}, and which stays in (to) once 



hitting this set (such a 7 exists by Remark 1.7 (a)). Then jj to at level i for each j — l,...,k. 



Proof: There is nothing to prove for 71 = s and any jj G V< \m). So let r be any other state 
visited by 7, pick an arbitrary n G M«\{tii} with E(z*(r, n)) = E(z*(r, to)) and then any minimal 
path r from r to n. Let a be the subpath of 7 from s to r. We must show that r hits V< (m). 
First, we point out that the maximal energy E(z*(s,r)) V E(z* (r,n)) of <jt, the concatenation of 
a and t, satisfies 

E(z*(s,n)) < E(z*(s,r))V E(z*(r,n)) < E(z* (s,m)) V E(z* (r,m)) = E(z*(s,m)) < E(z*(s,n)), 
implying err G r*(s,n) and, furthermore, 

E(z*(s,r)) VE(z*(r,n)) = E(z*(s,m)) VE(z*(r,n)) = E(z*(s,m)) V E(z*(r,m)) = E(z*(s,m)). 

Thus err must hit V^^to). But since a does not hit V^^m) by assumption, we conclude that t 
must hit V<\m). Since r G T*(r,n) was arbitrary, we infer r to at level i. □ 

The next two propositions provide information on the shape of the valleys and their nested 
structure. 

Proposition 1.11. For every to G M^ % ' and 1 < i < n, V^(m) is connected. 
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Proof: We use an inductive argument. If i = 1, the assertion follows directly from the definition 
of the level-one valleys because any s G V^(m), to G may be connected to to by a minimal 

path that eventually enters V< (to) without hitting any other V< (n) and is therefore completely 
contained in V^(m) by the previous lemma. 

Turning to the inductive step, suppose the assertion holds true up to level i — Fix any to G MW 
and notice that, by the inductive hypothesis, V( t-1 )(m) as well as all V^'{rrv^') with — i and 
m 0) rn at level i are connected. Now, since these as well as all s G ./V^ 1 ) attracted by to 
at level i may be connected to m by minimal paths as assumed in Lemma |1.10[ we conclude that 
(to) is also connected. □ 

The second proposition shows the nested structure of our construction of valleys. 

Proposition 1.12. The following inclusions hold true: 

(a) V {1 \m) C ... C VW(m) /or eacft to g M^, 1 < i < n. 

(b) V® (to) C VW(n) /or eac/i 1 < i < j < n, n G M^') and to G M"W n V^(n). 

Proof: Since there is nothing to show for (a) we move directly to (b). But if m G n V^\n), 
then the definition of valleys ensures the existence of 1 < k < j — i and of n\,..., rik-i G M^\M^' 
such that 7i p _i n p at level i p for each p — 1,..., fc and levels i < l\ < ... < Ik = j, where uq := to 
and nk :— n. As a consequence, 

F W (to) C ^ (Zl) (nx) C ... C V {lk - l] {nk^) C T/(j)(n) 

which proves the asserted inclusion. □ 

To finish the analysis of the shape of the valleys we show that they have the following important 
property: a special class of minimal paths from the inside of any V^'irn) to the outside of it must 
hit its interior V< (to) . But in order to show this we must first verify that all states attracted by 
m at level i belong to V^'(m). 

Lemma 1.13. For each 1 < i < n and m G M^ l \ we have that 



s G S 



S TO 



at level iX C V {t) {m) C js G S E(z*(s,m)) < E(z*(s,m')) for all to' G M (i) j 



Proof: For the second inclusion it suffices to refer to Proposition |l ,6[ c) . The first inclusion being 
obviously true for s G A^ l_1 ), we turn directly to the case when 

s ni at level li, ni n 2 at level l 2 , ... n)~—i ~^ at level 

with fc > 1 and 1 < lx < ... < Ik < i— 1. Here, ni denotes the first minimum to which s is attracted 
(thus s G V^ ll \nxj). while is the last minimum of this kind in the sequence. We may assume 
without loss of generality that rij ^ to for all j, for otherwise the assertion is clear. 

We show now that ni to at level i which in turn implies nj to at level i for all 1 < j < fc. 
As a consequence, njt G V^'(m) and thus s G V^\m). If £ , (z*(tii,to)) < £J(^*(rii, m')) for all 
m' G M"\{m}, the assertion is proved. Hence suppose -E(z*(ni, m)) > i?(z*(ni, to')) for some 
m' G AfW\{m}. Then 

E(z*(s,rri)) < E(z*(s, ni )) V E(z*(n u m')) 

< E{z*{s,m))V E(z*(m,m)) 

< E{z*{s,m))V E(z*{s,m)) 
= E(z*{s,m)) 
<E(z*( S> m')), 

implies E(z*(s,m)) = E(z*(s,m')) and also that the concatenation of any minimal path 7 from s 
to n\ and any minimal path r from n\ to to' (with maximal energy E(z*(s, Tlx)) V E{z*{n\, to'))) 
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constitutes a minimal path from s to ml and must therefore hit V<? (to) . Note that we can 
choose 7 to stay in V^ ll '(ni) since s G V^ ll '(ni) and V^(ni) is connected. Now, if r hits 
V<\m), then E(z*(nx,m)) = E(z*(ni, to' )) a nd we are done. Otherwise, 7 hits V< (to) implying 



V^(ni) n v£ J (m) 7^ 0. Now use Lemma 1.8 to conclude n% € V< (to) and therefore m to at 



level i. This completes the argument for the first inclusion. □ 
We provide too further lemmata that will be needed later on. 

Lemma 1.14. Let m € M^\ x ^ m at level i and y £ V^'{m). Then either every minimal path 
from x to y hits the set V<\m), or E(z*(x,y)) > E(z*(x,m)). 

Proof: Suppose there is a minimal path 7 from x to y avoiding V<\m). Since y £ KW(to), it is not 
attracted by m at level i implying the existence of some to' 6 with E(z*(y, to')) < E(z*(y, to)) 
and of some r G T*(y,m') avoiding V< (m). Hence, the concatenation 7T avoids V<^(to) and must 
therefore have maximal energy larger than E(z*(x,m)). Consequently, 

E(z*(x,m)) < E(z*(x,y)) V E(z*(y,m)) 
<E(z*(x,y))W E(z*(y,m)) 
< E(z*(x,y))VE(z*(x,m)), 

and thus E(z*(x,y)) > E(z*(x, to)). □ 

In order to state the second lemma, let us define the outer part d + V of a valley V to be the set 
of those states outside V which are adjacent to a state in V. With the help of the previous result, 
we can easily show that d + V contains only non-assigned states at any level where V has not yet 
been merged into a larger valley. 

Lemma 1.15. For any 1 < i, j < n and m = with l(i) > j, the outer part d + V of the valley 
V := V < " jAl '(m) is a subset of N^' and E{z* (s,to)) = E[s) for every s € d + V. 

Proof: First, let s £ d + V and suppose that s £ NU). Then s ~~> to' at level k, in particular 
s € V^(m') for some m' € and k < j . Pick any r 6 V with r ~ s and note that 



r 6 tJ+V^'fm'). W.l.o.g. we may assume that r to at level j Ai. Then Lemma 1.14 (with x = r 
and y = s) ensures that either E(z*(r,s)) > E(z*(r, to) > £7(r), thus z*(r,s) = s and £7(r) < -E(s), 
or r G V<(to) and, for some n G M^ M \ 

E(z*(r,m)) < E(z*(r,n)) 

< E(z*(r,s))y E(z*(s,n)) 

<E(z*(r,s))V E{z*{s,m)) (3) 

< E(z*(r,s)) V E(z*(r, to)) 
= B(s)V^*(r, to)), 

and thus again E(r) < E(s). On the other hand, by the very the same lemma (now with x = s and 
y = r), we infer E(r) > E(s) which is clearly impossible. Consequently, s must be non-assigned at 
level j as claimed. 

For the second assertion take again s G d + V and a minimal path 7 = (s, r,..., to) from s to 
with r G V. Again, by use of Lemma 1.14[ we find either E(z* (r, to)) < E(z*(r, s)) = E(s) 



in 



or r G V<(m), leading analogously to equation |3| to £(z*(r, m)) < £?(s) V E(z*(r,m)) and thus 
E(z*(r,m)) < E(s). In conclusion, both cases result in 

E(z*(s,m)) = £(s)V E(z*(r,m)) = E(s), 

finishing the proof. □ 
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The reader may wonder why valleys are defined here via essential saddles and not via the at first 
glance more natural overall energy barriers, viz. 

I(s,m) := inf /(71,-, 7m) ( 4 ) 

7gr(s,m) 

with 

n 

I( Sl ,...,s n ) :=Y,(E(s l )-E( S ^ 1 )) + 

i=l 

for a state s in a valley and the pertinent minimum m. This latter quantity, also called activation 
energy, is indeed an important parameter in |21| . The reason for our definition becomes clear when 
regarding the last proposition which shows the nested structure of valleys of increasing order and 
which may fail to hold when choosing an alternative definition based on the activation energy. 
Valleys that are formed in one step may then be ripped apart in the next one. This happens, for 
instance, if there is just one large saddle along the path to a metastable state and several small 
ones, lower than the essential saddle, along the paths to another minimum such that their total 
sum is larger than the big saddle. In further support of our approach, it will be seen later that the 
essential saddles are the critical parameters for the behavior of the aggregated chain (see Theorem 
2T| . 

In a nutshell, by going from (V^ (m)) meRI (i) to {V^ l+1 \ra)) m( z M {i+i) , some valleys are merged 
into one (with only the smaller minima retained as metastable states) and additionally those states 
from are added which at level i were attracted by metastable states now all belonging to the 
same valley This induces the following tree-structure on the state space: 

• Fix = sq. 

• The first generation of the tree consists of all m <E Af( n ~^ U A/^" -1 ) and are thus connected 
to the root. 

• The second generation of the tree consists of all m e Af( n ~ 2 ' U N( n ~ 2 \ and m is connected 
to the node k of the first generation for which E(z*(m 1 k)) is minimal or to itself (in the 
obvious sense). 

• This continues until in the n th generation each state is listed and connected either with its 
unique point of attraction in the previous generation or with itself. 

Example 1.16. For the energy function of Example |1 ,2| and depicted in Figure [T] the described 
tree is shown in Figure [3j The sets of non-assigned states at the different levels are 




Figure 3: The tree belonging to Figure [T] 

7V (1) = {3, 5, 7, 9, 11, 13}, iV (2) = {3, 5, 7, 11, 13}, 

N {3) = {3,5,7, 11}, iV (4) = {3,7, 11}, 

7V (5) = {7,11}, AT (6) ={11}, 
= 0. 
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At each level i of such a tree the subtree rooted at a node m £ M^ 1 ' consists of the states in the 
valley V^(m). A similar graph-theoretical modeling in order to visualize high dimensional energy 
landscapes has been used, for example, by Okushima et al. in [15) . These authors work with 
saddles of paths as well. In contrast to our approach, every possible path, that is every possible 
saddle is represented as a node in the tree. But as we will see, in the limit of low temperatures 
(/3 — > oo) the essential saddle is all we need. 

Now there are two fundamental directions for further investigations: 

(1) MICROSCOPIC View: What happens while the process visits a fixed valley V (of arbitrary 
level)? 

In Section [2] we will show that during each visit of a valley V its minimum will be reached 
with probability tending to 1 as (3 — > oo, and we also calculate the expected residence time in 
V, establish Property 3 stated in the Introduction and comment briefly on quasi-stationarity. 

(2) Macroscopic View: How does the process jump between the valleys? 

In Section [3j by drawing on the results of Section [2] we will show that an appropriate 
aggregated chain is Markovian in the limit as j3 — > oo and calculate its transition probabilities. 
With this we will finally be able to provide the definition of MB and establish Properties 1, 
2 and 4 listed in the Introduction. 



2 Microscopic View: Fixing a Valley 

Based on the provided definition of valleys of different orders, we are now going to study the process 
when moving in a fixed valley. 



2.1 Trajectories for (3 -> oo 

The first goal in our study of the microscopic process and also the basic result for the subsequent 
analysis of the macroscopic process deals with the probabilities of reaching certain states earlier 
than others. From this we will conclude that in the limit f3 — >■ oo (which is the low temperature 
limit in the Metropolis Algorithm) the process, when starting somewhere in a valley, will visit its 
minimum before leaving it. 

For A C S and x € S, we define 

t~A '■= inf{n > 1|A„ 6 A}, t x := t {x} and JV(x) := {y 6 S\p(x 7 y) > 0}. 

Two states x,y with p(x,y) > are called neighbors (x ~ y) and N(x) the neighborhood of x. 
Hence, S may (and will) be viewed as a graph hereafter with edge set {(x,y)\x ~ y}. Given any 
subgraph A, we will write P for the transition matrix of the chain restricted to A (p(r, s) — p(r, s) 

for all distinct r, s € A, p(r, r) = 1 — X) s gA s ^rP( r ' s )) anc ^ ^ f° r probabilities when regarding this 
restricted chain starting at x G A. 



2.1.1 Hitting probabilities 

Theorem 2.1. Let x,y,z € S be any pairwise distinct states satisfying E(z* (x, z)) > E(z*(x,y)) 
and z*(x,z) ^ x. Then there exist nonnegative constants K(f3) satisfying sup« >0 K(fi) < oo (and 
given more explicitly in Proposition 2.4 below) such that 

Px(T* < T y ) < K((3) e -PiE(z*(x,z))-E{z»(x,y))-7 lfl ) = . ^ ^ ^ p) Q 

Thus in the limit of low temperatures (/3 — > oo), only the smallest of all possible energy barriers 
affects the speed of a transition. In particular, we have the following result which is preliminary 
to the subsequent one. 
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Theorem 2.2. Given distinct x,y € S and m £ M^' such that x m at level i and y ^ V^(m), 
let B := {z G <S|-E(z* (x, z)) > E(z*(x, m))}. Then it holds true that 



c ( ^ 7"ni ) 

/ \ 



< e(x,m,2/,/3)l B (y) + 



\z:E(z)>E(z*(x,y)) 2SV^°(m) / 



e(x,m,y,p) ^-^T 0. 



Theorem 2.3. Groen m € AfW, a; € F w (m) and y ^ V^^(m), let k < i be such that 

too := £ mi a£ level l\ 1 m\ TO2 af ZeweZ ?2j •■• Wfc-i wife — m at level Ik 

for suitable 1 < h < ■■■ < h < i, mj G M^' /or j = l,...,fc determined by the construction in 
Definition \1.5[ Then 

k k 

Px{T y <T m ) < y^Pm.j-iC'TV < Tm.,) < ^ i(mj-i, TUj, y, /3) — + 0. 

For the valleys as defined here this confirms Property 1 stated in the Introduction: If ft is 
sufficiently large, then with high probability the minimum of a valley is visited before this valley 
is left. 

All three theorems are proved at end of the next subsection after a number of auxiliary results. 
2.1.2 Auxiliary results and proofs 



The proof of Theorem |2 . 1 1 will be accomplished by a combination of two propositions due to Bovier 
et al. [5] for a more special situation. We proceed with a reformulation of the first one in a weaker 
form and under weaker assumptions. 

Proposition 2.4 (compare Theorem 1.8 in |5j). Let x,y,z G S be pairwise distinct such that 
z* (x, z) ^ x. Then 

V x {t z < t x ) < K{fi) l^r 1 e- p{E{z ' (x ' z)) - E{x) - 2 ~<t\ 

Px(T„ < T X ) > | 5 |-l e -f(S(**(*.»))-«(*)+67/.) j 

where K(j3) := \S\ max re5 \Af(r)\ (|S| e -^ min ^^(°>>Ew( £ ( Q )- B ( b ))- 2 ^) + l). 

The proof requires several lemmata, the first of which may already be found in |14[ Theorem 
6.1] and is stated here in the notation of [5]. 

Lemma 2.5 (see Theorem 2.1 in (5]). Defining 

n x z :={h:S^ [0, 1] | h(x) = 0, h(z) = 1} 

and the Dirichlet form 

8(h) := ]T 7T(r)p(r,s)(h(r)-h( S )) 2 , 

we have 

Px(r z < t x ) = -i— inf £{h). 

Lemma 2.6 (see Lemma 2.2 in [5]). For any subgraph Ac5 containing x,y and corresponding 
transition matrix P, we have 

Vx(Ty < T x ) > ¥ x (T y < T x ). 
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Lemma 2.7 (see Lemma 2.5 in [5]). If A — (ujq,..., u>f.) is an H one- dimensional subgraph of S, then 



Proof: (of Proposition 2.4) In view of Lemma 2.5 we must find an appropriate function h for the 



upper bound. Let us define 

TZ := {s £ <S | 3 7 € r(x, z) : argmax -E(7i) = s}, 

i=l,...,M 

the set of all peak states (with respect to the energy function) along self-avoiding paths from x to 
z. Obviously z*(x,z) £ TZ and, by non-degeneracy, E(z*(x, z)) < E(s) for all s £ 7Z\{z* (x, z)}. 
The set TZ divides S into a set 1Z X containing x (and consisting of those y that can be reached from 
x without hitting 1Z) and a set TZ Z containing z. Now choose 

h(r) := l^uw(r), r G 5. 

Note that only neighboring states contribute to the Dirichlet form £(h) and that, for any two such 
r ~ s, the term (/i(r) — h(s)) 2 is positive iff one of these states is in 1Z and the other one in 1Z X , in 
which case the squared difference equals 1. By invoking Lemma |l.l| we obtain 

P x (t z < t x ) = — V — inf £{h) 
2t:(x) h£H% 



AzHx^l £ ^^L p (r, s) + |AA(z* (a ;,z))| 

< e -P{E{z*{x,z))-E{x)-2 lfs ) 



£ H e -^(mina^: B (a)> B ( i ,)(S(a)- J E(6))-2 7 , J ) + |^( z *( a . j z ))| 

< e -0(E(2*(a;,z))-.E( i E)-27 < O 

X max|AA(r)| (| 5 | e -/3(min a ^ :E(£l)>E(b) (£;(a)-B(b))-2 7/j ) + A 



Lemmata |2.6| and |2.7| will enter in the proof of the lower bound. Consider the chain restricted 
to the one-dimensional subgraph given by a minimal path p = (si,..., s\ p \) from x to y. Then 

P-(r„<r.) > ff; ^ 1 



^ 7r(si) p(s,, Sj_i) 



4*) \5 



> g - (x,2,))-£(x)+2 79 ) ^- e -/3(iJ( z *( a: ,a))-£;( Sl )-(B(s 1 -i)-B(s l )) + -373) 

> -,8(£;(z*(x,j/))-B(x)+579) J_ 

151 • 



This completes the proof of Proposition |2.4| □ 
We proceed to the second proposition needed to prove Theorem |2.1| 
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Proposition 2.8 (see Corollary 1.6 in [5]). Given I C <S and distinct x,z <G S\I, 



\(t z < r/) 



e(TfiJ{*} < T x) 



holds true. 



With the help of Propositions |2.4| and |2.8| the proof of Theorem |2.1| can now be given quite 
easily. 



Proof: (of Theorem 2.1) By first using the previous result and then Proposition 2.4 (with K((3) as 
defined there), we find 



,{r z < T y ) 



-X T z < T{x,y}) 



< 



x(T{z,v} < T x) 

x{j~y ^ T~xj 
< K(B) e -/3(S(^*(^^))-S( 2 *(a : ,y))-77^)^ 

The argument is completed by noting that E(z* (x, z)) > E(z*(x,y)) and K(/3) > converges to 

|5|max|AA(r)| 

res 



as (3 — > oo. 



□ 



Proof: (of Theorem 2.2) If B occurs, the asserted bound follows directly from Theorem 2.1 Pro- 
ceeding to the case when B c occurs, i.e. E(z* (x,y)) < E(z*(x,m)), we first point out that 

P x (T y < r m ) = P x (T y < T m ,E(X n ) > E(z*(x,y)) for some n < r y ) 
+ Vx(T y < r m , E(X n ) < E(z*(x, y)) for all n < T y ) 
=■ Pi+P 2 - 

For all z e 5 with E(z) > E(z*(x, y)), we have z*(x, z) ^ x and E(z*(x, z)) > E(z*(x, y)), for 

E(z*(x,z)) > E(z) > E(z*(x,y)) > E(x). 
Therefore, by an appeal to Theorem |2.1[ 



Pi < Px(t z < T y for some z with E(z) > E(z*(x,y))) 

z:E(z)>E(z*(x,y)) 
z:E(z)>E(z*(x : y)) 

To get an estimate for P2, note that every minimal path from x to y must pass through V< (m) 
(Lemma 1.14 1. With this observation and by another appeal to Theorem 2.1 we infer 

Vx{t v < r m , E(X n ) < E(z*(x, y)) for all n < t v ) < ^ V ^ T v < r ™) 

zevf (ra) 

z£V^\m) 
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having further utilized that (by Proposition 1.6 'f ) and (e)) z*(z,y) ^ z and 



E(z* (z,m)) < E(z*(z,m')) 

<E(z*(z,y))VE(z*(y,m')) 
<E{z*{z,y))VE(z*(y,m)) 
<E(z*(z,y))VE(z*(z,m)) 
= E(z*(z,y)) 

for some m' £ M^\{m} with E(z*(y, m)) > E(z*(y, m')), which must exist since y 



Proof: (of Theorem 2.3) We first note that y £ V^ li \nij) for all 1 < j < k. With m,Q,...,mk as 
stated in the theorem (recall mo = x and nik — m), we obtain 



< 



- mi (^y ^ 7"m) ~t~ Pxvy ^ ^~mi ) 



< Pm 2 (Tj, < r m ) + Pmi ( r y < T m 2 ) + P* (^ < T mi ) 
3=1 

Finally use Theorem |2. 2 1 to infer 

P™ 3 -i( T a < r m.,-) < £(™>j-i>mj,y,P) 
for each j = 1,..., k. □ 

2.2 Qu ASI- STATION ARITY AND EXIT TIME 

Naturally, several other questions concerning the behavior of the process when moving in a fixed 
valley are of interest, and quasi- stationarity may appear as one to come up with first. For a 
given valley V (of any level), a quasi-stationary distribution v = (v(j))jev is characterized by the 
quasi-invariance. viz. 

F„(X n = j\ Ts \v >n)=v{j) for all i eV, (5) 

but also satisfies 

lim ¥^(X n = j\ Ts \ v >n) = u{j) for all jeV (6) 



n— foo 



if /j, is an arbitrary distribution with /j,(V) = 1. The latter property renders uniqueness of v. Since 
S is finite, existence of v follows by an old result due to Darroch & Seneta It is obtained 
as the normalized eigenvector of the Perron-Frobenius eigenvalue A = A(^) of a modification of P. 
This eigenvalue A is also the probability for the chain to stay in V at least one step when started 
with v, thus P 1/ (ryc > 1) = A. As an immediate consequence, one finds that the exit time Tyc 
has a geometric distribution with parameter 1 — A under P„. In the present context, this naturally 
raises the question how the parameter A relates to the transition probabilities or the energies of 
the valley V. A simple probabilistic argument shows the following basic and intuitively obvious 
resul t concerning the eigenvalues associated with the nesting V"W(m) C ... C V^'(m) (Proposition 



1.12) for any 1 < i < n and m £ 



Proposition 2.9. Fixing any 1 < i < n and m G M«, let \<J) := X(V^(m)) for j = 
Then A« < ... < A«. 
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Proof: Write Vj as shorthand for the quasi-stationary distribution on V^\m) and Tj for T S \ V (j)/ m y 
Plainly, Tj < T J+1 

(A«) n = ¥ Vj {T J >n) < V Vj {T J+1 >n) 

Px k (T J+ i >n-k) dP v . 

{T j + 1 >k} 

= F Vj (T j+1 > k) P Mfc (T j+1 >n-k), (7) 

where fik(x) := P„.(X& = x\Tj+i > k) for x € V^ +1 \ Since S is finite and by virtue of |6l), we 
have that fik < ^j+i when choosing k sufficiently large. For any such k, we find that (|7| has 
upper bound 

2K j (T j+1 >k)¥ Uj+1 (T j+1 >n-k) = 2P^.(T J+1 > fc)(A^ +1 ))™- fe . 
Hence, we finally conclude 

X U) < (2F Vj (T j+1 > fc)(A (j ' +1 >)- fe V "a (j ' +1 ) 
and thereby the assertion upon letting n — > oo. □ 

An alternative matrix-analytic proof draws on an old result by Frobenius (9], here cited from 
PI Chapter III, §2, Lemma 2]. 

Lemma 2.10. If A = (a^) and C — (cij) denote two real kx k-matrices such that A is nonnegative 
and irreducible with maximal eigenvalue X* A and \cij \ < aij for all 1 < i,j ' < k, then |A| < for 
all eigenvalues A of C. 



Second proof of Proposition 2.9' For any fixed valley V, collaps all states s V into an absorbing 
state (grave) A which leaves transition probabilities between states in V unchanged. A proper 
rearrangement of states allows us to assume that the new transition matrix has the form 

1 

p Q 

for a | V\ x 1-column vector p ^ and a nonnegative, substochastic and irreducible \V\ x |y|-matrix 
Q. Now, for any 2 < j < i, let A be this matrix Q when V = V^'(m), and D be this matrix when 
V = V( j -V(m). Then, obviously, 

HZ ° 

and A is irreducible and nonnegative with maximal eigenvalue \u>. Defining further 

the largest eigenvalue of C equals the largest eigenvalue of D, thus A^ -1 - 1 . Finally, the desired 
conclusion follows from the previous lemma, since |cy| = < for all i < i,j < k. □ 



Another question is how long a given valley is visited and thus about its exit time. There is 
an extensive literature on exit problems for different kinds of stochastic processes. We mention 
(221 Ch. XI.2] and Ch. 4, §4, Theorem 4.1] as two related to our work. The latter one studies 
perturbed systems on a continuous space. We can discretize their argument to get, with use of 
the main theorem in j2T], a nice result on the time needed to leave a valley V^ l \m) for any fixed 
1 < i < n and m G M™. This result is more explicit than the one in |16l Proposition 4.6]. 
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Definition 2.11. For 1 < i < n, N := we define the following stopping (entrance/exit) 

times: 

4 — T N- 



C« :=mi{k>&\X k eN} 

:=mf{fc>C<?|A fc GiV c }, n>0. 



The entrance times mark the epochs when a new valley is visited, while the exit times 
are the epochs at which a valley is left. The reader should notice that we do not restrict ourselves 
to valleys of order i but include those valleys which up to order i have not yet been absorbed by 
some larger valley. Exit and entrance times never coincide since there is no way to go from one 
valley to another without hitting a non-assigned state - crests are always non-assigned (sec Lemma 



1.15| ). 

In this section, we will focus on ^ for any fixed i, thus writing Co := Co^ hereafter, but later 
for the macroscopic process the other times will be needed as well. 

For each valley V^lrn), m G let us define 

s m = sW : — argmin E(s) — argmin E(z*(rn, s)), (8) 



where the second equality follows from Lemma 1.15 
Theorem 2.12. Let m G M {i) . Then 

lim 4 lnE rCo = E(s m )-E(m) 

for any r G V^(m). 

For the upper bound, we need a result from |21j . which in our notation is: 

Proposition 2.13 (see Main Theorem (iv) in For any 1 < i < n, f3 sufficiently large and 

t > 2*- 1 exp(/?(£(s m(i -i)) - E{m^) + 2i\S\jp)), 

sup P x (r M (i) >t)< cxp(-A/3) 

holds true with a positive constant A, where = S should be recalled. 

This will now be used to show the following result. 

Lemma 2.14. Fix 1 < i < n, m G M^' and r G V^(m). Then, for any f3 sufficiently large and 
t > T exp(f3(E(s m ) ~ E(m) + 2(i + l)\S\-/p)), it holds true that 

P r (Co < (i + l)t) > \- 

Proof: Let us first note that we can always arrange for m being equal to by sufficiently 
decreasing the energy function at any m' G MW\{m} so as to make E(s m ) — E(m) minimal 
among all states in This affects neither the valley fW(m) and its outer boundary nor the 

distribution of £g when starting in m, for this distribution does not depend on the energy landscape 
outside of V^(m) U d + V^(m). When applying the previous proposition, the constant A may 
have changed but is still positive which suffices for our purposes. So let m — m^' hereafter. 

Fix t > 2 l exp(f3(E(s m ) - E(m) + 2(i + 1)\S\^ )) and T := it. Since 

E(s m ) - E(m) > E(s m ^) - E(m&) 
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for every 1 < j < i, we infer 

Pr(r MW <T) > P r (T MW <T,t mW <t) 



> P,.(t M (d < t) inf F x (t mW < (i - l)t) 

> P,.(r M( i) < t) inf P x (T M(2) <t) inf P x {t mW < (i - 2)f) 



> TT inf P x (t mU ) < t) 

- LJ r leMlM) 

> (l-exp(-A/3)) J 
3 

> - 
- 4 

for /3 sufficiently large. Furthermore, for /3 so large that P r (r M <i) < r m ) < 1/4, we find that 

Pr(7"M«) < r ) = Pr(T M (*) = r m < + P r (r M(l) < T,T M (i) < T m ) 

< Pr(r m < T)+P r (r M(s , < 

< Pr(r m < T) + J. 
By combining both estimates, we obtain 

Pr (T m < T) > P r (r MW <T)-i > 

Hence, state to is hit in time T with at least probability 1/2 when starting in r. Since to = nrS l \ 
we further have 

Pm(Co<*) > P m (r M(l+ i, < t) > l-exp(-A/3) > 1 

for /3 sufficiently large. Hence, state s m is hit in time i with at least probability 1/2 when starting 
in to. By combining the estimates, we finally obtain 

Pr(Co < (» + 1)«) > Pr(Co < T + i|r m < T) P r (r m < T) 
> Pr(r m <T)P m (Co <f) 
1 



> 



4' 



which proves our claim. 



□ 



Proof: (of Theorem 2.12) Using the lemma just shown, we infer 

Er(Co) < 



< 



< 



i + l)t^(n + l)P r {n{i + l)t<( < (n + l)(i + l)t) 

n>Q 

i + l)tJ2( n + 1 )( v r (Co > n(* + l)*)-P r (Co > (n + l)(* + l)t)) 

ri>0 

i + l)<^P r (Co > n(* + l)t) 

n>0 

i + l)i V fmaxP^ (Co > (i + 1)*)^) 

* — ' \ / 



n>0 
n>0 

4(t + l)t, 
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where t := 2 l exp((3(E(s m ) — E(m) + 2(i + l^S^p)) + 1. Since 7^ -> 0, we get in the limit 

lim ilnE r Co < E(s m )-E(m) 

for all r € V {i) {m). 

Turning to the lower bound, define a sequence of stopping times, viz. po := and 

p n := inf{fc > p n _i|X fc = m or A fe e d+V} 

for n > 1. Then Z„ := Xp n , n > 0, forms a Markov chain the transition probabilities of which 
when starting in m can be estimated with the help of Proposition |2.4[ namely 



V{Zi € d+V^(m)\Z = m) = ¥ m ( Pl = Co) 

= P m (Co < r m ) 

< if (j3) e ~^ min .es+v -E(z*(m,s))--B(m)-2 7fi ) 

= if(/3)e~ /3(B(s " l) ~ B(m) " 27 ' 9:i 

where K(j3) ->Kc (0, 00) as /3 — > 00. Further defining v := inf{fc > 0\Zk € this implies in 

combination with a geometric trials argument that 

F m (v>n) > (l-Kffle-fiW'^-EW-Wy - . 

As a consequence, 

n>l n>l 

For arbitrary r S V, we now infer 

ErCo = ^rCol{Co<Pl} + ^Col{Co>Pl} 

> E,.(E r (Col { c > Pl }I^Pi = m )) 

> E r l{ (o>pl }E m ( 

> Pr(Co > Pl)K(P)- 1 e /3(^m)-E(m)-2 7e ) 

> iK(/3)- 1 e ' 3 ( £; ( s ™)-- B ( m )- 2 ^) 



for all sufficiently large /3, because lim^oo P r (Co > Pi) = 1 (Theorem 2.3 1. Finally, by taking 
logarithms and letting /? tend to 00, we arrive at the inequality 

lim ilnE r Co > E{s rn )-E{m) 

/3->oo p 

which completes the proof. □ 

In [12], E(s m ) — E(m), m € AfW, is referred to as the depth of the valley V(m). Therefore, 
Property 3 from the Introduction holds true and we can relate thermodynamics of the system 
(energies) to dynamics of the chain (holding times) in a very precise way. Especially, there is no 
universal scale for the times spent in different valleys because in general they differ exponentially. 
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3 Macroscopic View: Transitions between valleys 



With the help of the nested state space decompositions into valleys of different orders and around 
bottom states of different stability, we will now be able to provide an appropriate definition of the 
metabasins (MB) that has been announced and to some extent discussed in the Introduction. We 
will further define and study macroscopic versions of the original process X = (A„)„>o- These are 
obtained by choosing different levels of aggregation in the sense that they keep track only of the 
valleys of a chosen level that are visited by X. The motivation behind this approach is, on the one 
hand, to exhibit strong relations between properties of the energy landscape and the behavior of 



X (as in Theorem 2.121 and, on the other hand, to describe essential features of this process by 
looking at suitable macroscopic scales. 

In the subsequent definition of aggregated versions of A, we will distinguish between two variants: 

• A time-scale preserving aggregation that, for a fixed level and each n, keeps track of the 
valley the original chain visits at time n and thus only blinds its exact location within a 
valley. 

• An accelerated version that, while also keeping track of the visited valleys, further blinds the 
sojourn times within a valley by counting a visit just once. 

Actually, the definition of these aggregations at a chosen level i is a little more complicated 
because their state space, denoted S 1 -- 1 ^ below and the elements of which we call level i metastates, 
also comprises the non-assigned states at level i as well as the minima of those valleys that were 
formed at an earlier level and whose merger is pending at level i because their minima are not 
attracted at this level. 

Definition 3.1. Fix 1< i < n, let 



= {mS j) G MW\l(j) >i}U and 

'yW(mW), ifs 
y«( m (i)), if s 

{s}, if s e iV< 



for some j > i 
for some j < i 



for s g S^K Then define 



Y 



and YS l) 



{x„eyw(«)}) 



n > 0, 



Y 



esco 



n > 0, 



where a = a { 



and 



for n > 1. We call Y 



(i) 



= tr» 



= inf < k > <J n -\ 



(Yn )n>o an d Y^ = (Yn^) n >Q the aggregated chain (AC) and the 



accelerated aggregated chain (A AC) (at level i) associated with X = (X n ) r 



So, starting in an arbitrary valley, the original chain stays there for a time £o = Co ( as defined in 



Definition 2.11 ) before it jumps via some non-assigned states ki,...,ki (staying a geometric time in 
each of these states) to another valley at time £i = £^ . There it stays for £i — £i time units before 

it moves on in a similar manner. By going from X to its aggregation Y * at level i, we regard the 
whole valley V^(s) for s € 5W as one single metastate and therefore give up information about 

the exact location of X within a valley. Y^ is a jump process on <SM with successive sojourn 
times u n+ i — cr n , n > 0, which do not only depend on the valley but also on the states of entrance 
and exit. The AAC then is the embedded chain, viz. 



j>0 



{CTj<n<<Tj + i}: 
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giving the states only at jumps epochs: starting from the minimum of a first valley it moves to 
states k\,...,kx € and then proceeds to the minimum of a second valley, and so on. 

Of course, at small temperatures the time spent in a non-assigned state or in a valley around a 
low order metastable state is very small compared to the time spent in a valley around a metastable 
state of higher order. Thus, such states can be seen as instantaneous and of little importance for 
the evolution of the process. We account for them nonetheless for two reasons. First, in the path- 
dependent definition mentioned in the Introduction and used in Physics, they build small MB of 
great transitional activity of the process and are thus relevant in view of our goal to provide a 
definition of MB that conforms as much as possible to a path-dependent one. Second, a complete 
partitioning of the state space that is an assignment of every s £ S to a metastate via a global 
algorithm fails when merely focusing on {V^ (m),m € M^H because there is neither an obvious 
nor natural way how to assign non-assigned states to them. 

The incoherent scattering function and its associated relaxation time, for X defined by 

S(q, n) := cos (q\X n - X Q \) 

(with | • | being Euclidean distance in phase space) and 

T q (e) := inf{n\S(q, n) < e}, e > 0, 

respectively, may serve as an example which shows the strong relation between the behavior of the 
original process and its macroscopic versions. For more detailed information on the meaning and 
relevance of S(q, n) as a measure of incoherent scattering between the initial state of a glass-forming 
system and its state n time steps onward, we refer to the survey by Heuer [12J. 

Proposition 3.2. For each 1 < i < n there is a constant A(i) such that 

supvJx^Y^) <e- A ^. (9) 

n>Q V ' 

As a consequence, for any given e > 0, the incoherent scattering functions of X and Y differ by 
at most 4e for (3 sufficiently large. 

Proof: Use Lemma |l.l| to infer 



7T(V«(s)) 



z — ' 7T(sj 



* E 

sES('>\N(*> i£l'( i )(s)\{s) 

< max |U (l) (s)| max e -0(E(,x)-E(s)-2 70 ) ^ 
seS(^\N(*~> iey(-)( s )\( s } 



This proves equation p| because E(x) > E(s) for each x € V^{s)\{s},s € S^\N^. Now let j3 
be so large that e~ A W^< e for a given e > and observe that 

E T cos (g|X„-X |) 

= [ _ _ cos(q\X n ~ X Q \) dP„ + / _ _ cos(q\X n -X \) d¥^ 

J{x n =Y^ ) .x =Y{; ) } J{x„#Y| i l) }u{x #y<, l) } 

< f _ cos(q\X n -X \) d¥ n + P„(X n £yf) + P 7r (X ^Yf) 

■/{x„=yi i) ,x =?W} 

= E^os( q \Y^-Y^\) - / cos(g|Fi l) -F^|) dP 



2c 



-^(qY^ -Yf 
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and, by a similar argument, 

E^cos^F^-F^l) < E w cos(q\X n -X \) + 4e. 
This completes the proof. □ 

3.1 (Semi-) Markov Property 

In general, both aggregated chains are no longer Markovian. Transition probabilities of the AAC 
not only depend on the current state, i.e. the current valley, but also on the entrance state into 
that valley, whereas transition probabilities of the AC depend on the current sojourn times which 
in turn depend on the previous, the present and the next state. On the other hand, since valleys 
are defined in such a way that asymptotically almost surely (a.a.s.), i.e., with probability tending 
to one as f3 — > oo, the minimum will be reached from anywhere inside the valley before the valley 
is left, and since, furthermore, the exit state on the outer boundary a.a.s. equals the one with the 
smallest energy, the AAC will be shown below to converge to a certain Markov chain on Sw. Also, 
the sojourn times depend on the past only via the last and the current state. This means that the 
AC converges to a semi-Markov chain (for semi-Markov chains see for example [T]): 

Definition 3.3. Given any nonempty countable set 5, let (M n ,T„)„>o be a bivariate temporally 
homogeneous Markov chain on S x N, with transition kernel Q(s, •) only depending on the first 
component, viz., for all n > 0, s £ S and t > 0, 

P(M„+i = s,T n+1 < t\M n ,T n ) = Q(M n ,{s} x [0,t]) (10) 

holds. Put S n := Er=o T * for > and u(t) := max{ii > 0\S„ < t} (max0 := 0) for t > 0. 
Then Z n := M v i n \,n > 0, is called semi-Markov chain with embedded Markov chain (M„)„>o and 
sojourn or holding times Tq,Ti,.... 



Note that equation (10 1 holds iff M — (M„)„>o forms a temporally homogeneous Markov chain 
and the (T„)„>o are conditionally independent given M such that the distribution of T n only 
depends on M„_i, M n for n > 1 (in a temporally homogeneous manner), and on M for n = 0. 
Note further that we have specialized to the case where holding times take values in N only (instead 
of (0,oo)). 

Recall from ([8| the definition of s m for m G S^\N^ and notice that the second equality there 
entails E(z* (m, s m )) < E(z* (m, s) for any s € d + V^ (m)\{s m }. Further recall from our basic 
assumptions that p*(r,s) = lim^oo p(r, s) exists for all r, s £ S and is positive if E(r) > E(s). 
The following result, revealing the announced convergence for AAC, confirms in particular that a 
valley V(m), m £ S^\N^ % \ is a.a.s. to be left via s m . 

Proposition 3.4. For each 1 < i < n and as ft — > oo, the level i AAC converges to a Markov 
chain = (Yn ) n >o 011 with transition probabilities p(r, s) — Pi{r, s) stated below, that is 

lim V(Y(% = a |yW = rj yW = m n _ u ..., yW = m ) - p\r,s) 

p— >oo 

for all mo,...,m n _i,r,s € andn>0. We have p(r 7 •) :— S Sr if r € S^\N^ % \ and 

P(r,-) := * ( E P*(r,s)5 s + ^ ( E P^) ] 5.) , 

\«eJV(r)nJVW se5(')\Ar< 1 ) \r'eA/'(r)ny( I )(s) / / 

i/ r £ . 

yW = (y (l) ) >o is called the asymptotic jump chain at level i hereafter. Note that, typically, it 
is not irreducible. It may have transient states, not necessarily non-assigned, and its irreducibility 
classes are of the form {mi,..., rrik, s} for a collection mi,...,mfc £ S^\N^ and some s £ 
satisfying s = s mi = ... = s mk . 
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Proof: Fix 1 < i < n and write Y n for Y n ■ The first step is to verify that, as /3 — » oo, 

P(Y n+1 = s|Y„ = r,y„_! = m n _i r ..,F = m ) = P r (Y = s) + o(l) 

for all mo,..., m n _i, r,s£ 6>( 1 ' and n > 0. If r £ then Y„ = X„ n and the Markov property of 

X provide us with the even stronger result 

P(Y„ +1 = s\Y n = r,Y n -i = m n -i,...,Y = m ) - P r (Yj = s). 

A little more care is needed if r € S^\N^ l K For any s <E 5^, a; € F"(r) and n > 0, we have 

P(Y n+1 = s|Y„ = r,A CT „ =x) 

= P x (Fi = s, r r < o-i) + P*(Yi = s, r r > CTl ) 

= P r (Yi = s) P x (r r < + P s (Fj = a, T r > <t x ). 

The last two summands can further be bounded by 

P r (Yi = s)F x (r r < ax) < ¥ r (Yi = s) and ¥ x (Yx = s, r r > oi) < P^eri < r r ). 

For the last probability, Theorem |2.2| ensures 

P*(ai<r r ) < ^ Px(r 2 <r r ) < e&r,z,P) ^ 0. 

zS a+y(i)( r ) «68+VM(r) 

Consequently, as /3 — > oo, 

P r (n=s) = 1- e(x,r,z,0)\p r (Y 1 = s)+o(l) 

\ zed+V<-*1(r) J 

< (l-P x ( CTl <r r ))P r (Y 1 =s)+o(l) 

< P(Y n+1 = s\ X an = x, Y n = r) + o(l) 

<P r (Y= S ) + £ £ >,r,z,/?) + o(l) 

ze9+y(*)(r) 
= P r (Yi = s)+o(l), 

and therefore 

P(Y n+i = s\Y n = r. Yn-x = m„_i,..., Y = m ) 
= ^ P(Y n+1 = s|Y CT „ = x, Y n = r) F(X an = x\Y n = r, Y n -i = m n _i,..., Y = m ) 

= P r (Y = s) + 0(1). 



It remains to verify that P r (Yi = s) = p(r,s) + o(l) for any r,s £ <SW. If r £ S^\N^ l \ then 
cri = rjvt.) and Yi = I TMi) . Since £(z*(r,s r )) < £(z*(r,s)) for each s r ^ s e N® Ci d+V^{r), 
we now infer with the help of Theorem |2.1| 

P r (Y x ^ s r ) = P r (r s < r Sr for some s £ N^\{s r }) 

< P r( T »< T «J 

< e(r,s r ,s,P) 
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as P — > oo and thus P r (Yx € •) — > <S SV = p( r , ) as claimed. If r € A^, then either Yj = s G 
A/"(r) n JV"W, or Fi = s € S«\AW andX CTl = r' for some r' G Af(r) n F^(s). It thus follows that 

P r (*=.) = Wl = S ) = = -^- )+ o(l) 

1 — p[r, r) l—p(r,r) 

if s eAf(r)n AW, while 

p r (y 1=S )= ^ p r (x ffl =/)= x; /* (r ;r\ +°( i ) 

' 1 — »* (r, r) 

r'6A''(r)nV< i >(s) r' GJV(r)nV CO ( s ) v y 

in the second case. □ 

Having shown that Y^> behaves asymptotically as a Markov chain, viz. the jump chain it 
is fairly easy to verify with the help of the next simple lemma that the augmented bivariate AC 
(Y^\ ^"n+i)„>o ^ s as y m ptotically semi-Markovian. 

Lemma 3.5. For each j3 > 0, the sojourn times a n +i — a n , n>0, of the ACY are conditionally 
independent given Y^ . The conditional law of a n+ \ — o~ n depends only on (Y^-n Y n\ ^+i) ant ^ 
satisfies 



Geom(l-p(y,y)), if y E 

Es e Vi>Hy),s~M<?l e ■ K 1 = *)P x (X ffl = S ), i/j, £ A« 



Q((a;,y,z),-) := <j ^ m ,_ ,„(<) , v ' ^' _ Jir ffl ( n ) 



for all x,y,z£ with P(V„ ( ! ) 1 = x, V„ (i) = y, Y^ = z) > and n > 1. 

Proof: The assertions follow easily when observing that, on the one h and, at least one state y G A« 

must be visited between two states x,z G S^'\N^ 1 ' (Lemma 1.151 and that, on the other hand, 

(i) _ _ 

the original chain X and its aggregation Y coincide at any epoch where a non-assigned state is 

hit, which renders the Markov property of Y at these epochs. Further details are omitted. □ 

In order to formulate the next result, let = <7o < a\ < ... be an increasing sequence of 
random variables such that its increments a n +i — o n ,n > 0, are conditionally independent given 
the asymptotic jump chain Y^'. Moreover, let the conditional law of a n +i — v n depend only 
on (Y^,?^ and be equal to Q((Y^ l5 $£\ Y^), ■ ), with Q as defined in ((TTJ . Then 
((V„ (4) ,F„ ( ^ 1 ),o : n+ i)„>o forms a Markov renewal process and {Y^ n y *p(n+i)) n >o a semi-Markov 
chain, where v(n) := sup{fc > 0\dk < n}. 

Proposition 3.6. For each 1 < i < n, {(Yn\ Y^i), c n +i)n>o converges to the Markov renewal 
process ((Y^ % \ Y^lj), 5 : n+i)n>o in the sense that 

lim P ^(^ W ' F fe+i) = iV^Vk+i), o- k+1 = i k+1 , 0<k<n) _ 1 



t^°° F yo ((yW, y fe «) = (y fc) ttt+x), a fe+1 = i fe+1 , < fc < n) 

for all yo,...,y n+ i G , < i\ < ... < and n > suc/i i/iai i/ie denominator is positii 

Furthermore, (Y n , Y n+ i)n>o * s asymptotically semi-Markovian in the sense that 

lim M( F iV£iH(^+i)> < fc <") = j 

^ M(^2)^ ( (t. + i)) = (tf*,»*+i),0 < fe < n) 
/or a/Z ?/ovi 2/n+i € and n > such that the denominator is positive. 
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Proof: The first assertion being obvious by Proposition 3.4 note that it implies, with v(n) := 
sup{fc > 0|<Tfc < n}, 

lllll ^TTT ^7T7 = 1 

for all yor-jZ/n+i £ 5^ and n > such that the denominator is positive. Therefore the second 
assertion follows when finally noting that 



v(n) ~ 3 1 ct 3<™< ct j+i} — 



(0 



for each n > 0. □ 

So we have shown that, although aggregation generally entails the loss of the Markov property, 
here it leads back to processes of this kind (Markov or semi-Markov chains) in an asymptotic sense 
at low temperature regimes. 



3.2 Reciprocating Jumps 

As discussed to some extent in the Introduction, we want to find an aggregation level at which 
reciprocating jumps appear to be very unlikely so as to obtain a better picture of essential features 
of the observed process. To render precision to this informal statement requires to further specify 
the term "reciprocating jump" and to provide a measure of likelihood for its occurrence. It is useful 
to point out first that the original chain X exhibits two types of reciprocating jumps: 

Intra-valley jumps which occur between states inside a valley (starting in a minimum the 
process falls back to it many times before leaving the valley) . 

Inter- VALLEY JUMPS which occur between two valleys (typically, when the energy barrier between 
these valleys is much lower then the barrier to any other valley). 



E 




Figure 4: Illustration of intra-valley jumps (left panel) versus inter- valley jumps (right panel). 



Clearly, intra-valley jumps disappear by aggregating valleys into metastates, while inter-valley 
jumps may also be viewed as intra-valley jumps for higher order valleys and do occur when tran- 
sitions between any two of them are much more likely than those to other valleys in which case 
they should be aggregated into one valley. This motivates the following definition. 

Definition 3.7. We say the process (vi^neN exhibits reciprocating jumps of order e > if there 
exists a nonempty subset A £ SW\iVM with the following property: For each mi € A, there exists 
m-i £ A such that 




> £ 



for all m € S^\(N^ % ' U A). In other words, it is exponentially more likely to stay in A than to 
leave it (ignoring intermediate visits to non-assigned states). 
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In view of our principal goal to give a path-independent definition of MBs, we must point out 
that, by irreducibility, reciprocating jumps always occur with positive probability at any nontrivial 
level of aggregation and can therefore never be ruled out completely. This is in contrast to the path- 
dependent version by Heuer |12| in which the non-occurrence of reciprocating jumps appears to 



be the crucial requirement. As a consequence, Definition 3.7 provides an alternative, probabilistic 



and verifiable criterion for reciprocating jumps to be sufficiently unlikely in a chosen aggregation. 

The following proposition contains further information on which valleys are visited successively 
by providing the probabilities of making a transition from V^irn) to V^(m') for any m,m' € 
<SW\iVW. It is a direct consequence of the asymptotic results in the previous subsection, notably 
Proposition |3.4| 

Proposition 3.8. Let m G S^ Z '\N^' , s m G d + V^{m) be as defined in ^ (i.e., the state on the 
outer boundary ofV^(m) with minimal energy). Then 

1- mi iv xAl\t \\ -V \ SreAA(s m )nV( i )( m )P*( s m,?') 

lnnP m I( i eF , m = p(s m ,m) = \ ' y , 

while 

lim P m (A Cl G V {i) {m')) =p(s TO ,m') + V V p(s, n) • ... • p{r n -i, r n )p\r n , ml) 

n >! ri,...,r„eJV (i) 

for any other ml G SM\iVW. 

The reader should notice that, as p(s, r) = whenever E(s) < E(r), the last sum actually ranges 
only over those non-assigned r 1; ...,r„ with E(s m ) > E(r\) > ... > E(r n ) > E(m'). 

Proof: Let us first point out that ¥ r (X^ g G V^(m)) — o(l) as f3 — > oo for any r G N^ 1 ' such 
that E(r) < E(s m ). Namely, since the last property implies r g" d + V^\m), any path from r into 
yW(m) must traverse a state s G d + V^(m) with E(s) > E(s rn ) > E(r), whence the probability 
for such a path goes to zero as /3 — > oo. Noting further that V m (Y^ ^ s m ) — o(l) as f3 — >• oo by 



Proposition 3.4 we now infer (with £„ = ^) 



g F«(m)) = P Sm pQ g V«(m)) + o(l) 

= P 5m (F«=m)+ J2 fjfe^ r Pr(X fo €VW(m))+ (l) 
= p(s m ,m) +o(l). 

The expression for p(s m , m) in terms of the p*(s m ,r) may be read off directly from the formula 



given in Proposition 3.4 For m' =/= m, m! G S^\N^\ we obtain in a similar manner 

- p Sm (y« = m') + £ E p «»(*i W = ^i,...,^ = r^riji = ro') + (1) 

">1 ri ,...,r„6JV(') 

= p(s m ,mf) +y~] ^2 p(s m ,ri) ■ ... ■p(r n _ 1 ,r n )p(r n) m') + o(l), 

">! ri,...,r n eiVW 

the last line by another appeal to the afore-mentioned proposition. □ 

In essence, the previous result tells us that a valley V^ l \m!) is neighbored to V^(m), that 
is, reachable with positive probability by the asymptotic jump chain yW (and thus by Y^ % > at 
any temperature level 0) without intermediately hitting any other valley, iff there exists at least 
one (in terms of energies) decreasing path in from s m to m' . For any other such pair of 
valleys, connected by a path through states in N^> , the transition probability decreases to zero 
exponentially in /3. If this path can be chosen to be unimodal, here called uphill-downhill-path, 
this can be stated in a very precise way as the next result shows. 
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Lemma 3.9. Let m ,mi G SW\jVW be two distinct local minima for some < i < n. Suppose 
there exists a minimal path 7 = (70,..., 7^) from s mo to m\ not hitting any other valley but V^(mi) 
and such that J(7o,...,7fc) = E{z* {s,nii)) — E(s mo ). Then 

lim ilnP mo (X Cl eV«(mi)) - - (^*(m 0) mi)) - B(s mo )) . 

Without assuming the existence of 7 as stated, the result remains valid when replacing = with <. 

Note that /(7o,...,7fc) = £(z*(s mo ,rai)) — E(s mQ ) does indeed imply the already mentioned 
property that 

£( 7i ) > for 1 < t < j 

and £J(7j) < -B(7i-i) for j + 1 < i < k 

if 7j = 2*(s mo ,mi). We call such a path an uphill-downhill-path because it first straddles the 
energy barrier E(z* (s mo ,mi)) and then falls down to the local minimum mi. The existence of 
such a path can be found in most 2- or higher dimensional energy landscapes. 

Proof: With 7 as stated, a lower bound for F mo (X^ 1 € V^(mi)) is easily obtained as follows: 

= (1 + o(l)) e -^(^(2*(s m o.™i))-- E (^o))-7^|5| 
= (1 + O(l)) e -^( £ ( 2; *(™0:nH))-i?(s mo ))-7 3 /3|5|^ 



For an upper bound, which does not require the existence of a 7 as claimed, we decompose the 
rent into dis; 
This leads to 



event into disjoint sets depending on the number of visits N, say, to tuq between 1 and Co = Cn^- 



V m „(X ei G N = 0) = P mo (Ci = r vW(mi) < r ma ) 

and, for k > 1, 

,(A 6 G F (i) (mi), |{r mo < n < Col^n = ™o}| = fc - 1, r mo < Co) 
, (X 6l G V® (mi), JV = fc — 1) P mo (r mo < Co) 



- too V 

^ m I 
'mo ( 



s roo (X 6 g FW(mi),AT= 0)P mo (r mo < Co) 
*m (€i = T y(»)( mi ) < 7"too)^ > to, ( t tod < Co) fc - 



Consequently, 



P mo C% G ^ w (™i)) =^ P ™o(^6 e V^(mi),iV = fc) 

fc>0 

= ^Pm (Cl = r y(>)( mi ) < TVno) P OTo( r TOo < Co)* 
fc>0 

_ Ptoq (6 = r V(i)( mi ) < Tm ) 
P roo (Co<T mo ) 

By invoking Proposition |2.4| we infer 

Pm (6 = T 1/W( mi ) < T m ) EreV( i )(m 1 )" I '"'l)( T r < r mo) 



)(Co < T m ) Pmo( r x < T m ) 

< if e -0(E(z*(m o ,r))-E(z*(m o ,x))-7'Yi3) ^) 



reC('l(mi) 
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for all x G V^(m ) c , where 



K(J3) = \S\ \S\ exp -(3 min (E(a) - E{b)) + 2r/rf + 1 max \M{r)\. 

\ \ a^b:E(a)>E(b) J J r£S 

For any r G V^ l '(mi), we have E(z*(m,Q,r)) > _B(z*(m l7 r)) and therefore 

E(z* (mo, mi)) < E(z*(mo,r)) V E{z*{r, mi)) = £7(z* (mo, r)). 



Using this in (121, we obtain 

Pmofe } = TvW(mx) < T m ) < x .^| 5 | e _ i 8(£;(«*(mo,mi))-B( z *(mo,x))-r7p) 
mo (CO < T mo ) 

and then, upon choosing x = s ma and noting that E(z*(mo, s TOo )) = E(s mo ), 

P mo (A 5l e V W (mi)) < A'(/3) | l s| e -/5(s(^(™o,mi))-B( Smo )-7 7 ^) > 
By combining all previous results, we finally conclude 

lim ^lnP mo (X fl e K«(mi)) = -(^'K.mi))-^)) 

/3->oo p 

as asserted. □ 

To summarize, which valleys are visited consecutively depends on (a) their spatial arrangement 
and (b) the energy barriers between them: A transition from one valley V^(m ) to another 
valley vM(mi) is only possible, if there exists a path from s„ i0 to V^(mi), not hitting any 
other valley. This transition is made at small temperatures (i.e. large f3) if the additional en- 
ergy barrier E(z* (s m „ , mi)) — E(s mo ) is sufficiently small or in other words the energy barrier 
E(z* (s mo ,mi)) is approximately of the same height as all other energy barriers, including the 
barrier E(z*(s mo , m )) = E(s mo ). 

A result similar to the previous lemma holds true for transitions from m G S^\N^ 1 ' to any 
s G d+V(m). 

Lemma 3.10. Let m G S^\N^ and s G d+V {i \m). Then 

lim i lnP m (Vi = s) = -(E(s) - E(s m )). 

/3->oo fj 

Proof: For the proof, decompose again the event {Yi = s} with respect to the number of visits to 
m before V(m) = V'''(m) is left (or use Proposition 2.8|, giving 

V ^ P m (<7! < r m ) 

The proof of the lower bound is much more technical. Let 7 = (71 7„) G T* (m, s) be a minimal 
path which leaves V(m) only in the last step and such that for any 7^,7^ G 7 both, the subpath 
from 7^ to 7^, and the inversed path from jj to j{, are minimal. Define 

T{) := m and r\ :— 7i with iq := inf{0 < i < n — l|_E(7j_|_i) > E(s m )}. 

In particular, E{r\) < E(s m ) and E(z*{r\, Tq)) < E(s m ). Define furthermore the first record by 
si := 7^ with 

ii := inf {i < < < n | £(7;) > E(s m ), inf{j > i|£( 7j ) < £( 7l )} < inf{j > ^(7^ > £( 7i )}}, 
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and then successively for k > 1 with s k = ^ s the records Sk+i ■= li k+1 with 
h+i ■= inf 

inf{j > i\E{lj) < £?(7i)} < mf{j > i|£(7j) > £(7)}}- 

Note that the energy of these records is increasing. Let s k -i be the last record defined in this 
way and s k ■= s. Since E(z* (to, s)) = £7(s), is as well a record. Given the records S\,...,s k , for 
1 < i < k — 1 let r 2i be the first minimum along 7 after Sj and r 2 i+i the last minimum along 7 
before Sj+i. Here a minimum along 7 is some 7, e 7 such that it is a minimum of £7 restricted to 
7. Finally, let r 2 fc := s k = s. In the following we will proof that 

(a) F r2j (r r2j . +1 < Co) — > 1 as /? — > 00 for an Y < j < fc — 1, 

(b) P ri (r r2 < Co) > e-^ E(z '^' r ^- E ^+°W>, 

(c) Pr 23+1 (T r23+2 < Co) > e~^ E( - z ' ' r2 ^+ 2 - 1 ,r 2j ))+o(i)) for any 1 < j < - 2, 

(d) P r2fe 1 (r r2fc = Co) > e~ /3 ( £; ( z *( r2 ' c - 1 ' r2fe ^ _£; ( z *( r2 ' c - 3 ' r2fe - 2 ^ +0 ( 1 ^. 
This gives for (3 large enough 

(2fc-2 \ 
II P^(^ +1 < Co) J K 2k .Ar r2k = Co) 



> J !! P ^ + i(^ +2 < Co) Pr.^^T^ - Co) 



v.J= 

fe-1 



> J : e -/3(£( Z *(r 1 ,r 2 ))-E( Sm )+o(l)) . "Q £ -P(E(z' (r 2] + 1 ,r 2j+2 ))-E{z* (r 2j -i ,r 2j ))+o(l)) 



- 1 „-/3(£(z*(r 2fc - 1 ,r 2fc ))-,E(s m )+o(l)) 

2 

= e -«£( S )-E( Sm )+»(l)) i 

and thus the assertion. 

(a) For j = this is obvious since E(z* (r , ri)) < E(s m ). For 1 < j < fc— 1 and any r' e <9+F(m) 
it holds true that 

£(**(r 2j -,r 2j+ i)) < E{z*{r 2i ,r 2j ^)) < E(z*(r 2j ,m)) < E(z*(r 2j ,r')), (13) 

where we make use of the fact that between r 2j - and r 2 j + i the energy stays below the last record, 
and that all subpaths of 7 are minimal as well. 

(b) By the definition of n and r 2 , there is a unimodal path between them so that the cumulative 
activation energy along this path equals E(z*(r\, r 2 )) — E(n). Therefore 

P ri (r r2 < (CoAT ri )) > e -/3(^(-*(- 1 ,r 2 ))-E(r 1 )+o(l))_ 

Furthermore, for any r' G d + V(m) we have 

E(z*(r u r')) > E(s m ) >E(n) and E(z*(n, r 2 )) > E(s m ) > E(n). 

Therefore, 

P ri((Tr 2 A Co) < r ri ) < ^2 

Pri (jr' < T ri ) 

r'ed+V(m)u{r 2 } 

< ^ e -P(E(z'( ri y))-E( ri )+o(l)) 

r'ed+V(m)u{r 2 } 

< e -/3(B( Sm )-£;(ri)+o(l))_ 
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Combining the two estimates, we get 

(c) Let 1 < j < k — 2. We use the same strategy as in the proof of (b). So, again, by the 
definition of r 2 j+\ and ?"2j+2, there is a unimodal path between them with cumulative activation 
energy E(z*(r 2j +i, r 2j+2 )) - E(r 2 j+i) along this path, and 

F (t <T U~ n A t "I s ! > p-/ 3 ( E ( z *( r 2j + i^2j + 2))-E{r 2j + i)+o(l)) 

sr r2j + i\ ir 2 j + 2 ^ 1^0 '\ lr 2 j + i)) ^ e 

Furthermore, 

E(z*(r 2j -i,r 2 j)) < E(z*(r 2j+1 ,r 2:j+2 )) and S(r 2j+1 ) < E(z* (r 2j+1 ,r 2j+2 )). 



Finally, for any r' G 9 + y(m), by use of Equation (13 1 



E(r 2j+1 ) <E(z*(r 2j+1 ,r 2j )) 

< E(z*(r 2j ,r')) 

< E(z*(r 2j ,r 2j+1 ))WE(z*(r 2j+ll r')) 
= £(**(r 2i+1) r')). 



Thus, 



((Trsy+a A Co) < T T2] + 1 ) < 

^r 2 j+l i T r' < T r 2 j + i) 

r'£d+V(m)U{r 2j + 2} 

< ^ e -«E(2*( t i j+ i 1 r'))-B(r 8j+ i)+o(l)) 

r'ga+V(m)U(r 2i+ 2} 

< e -/3(S(r2i-i,r23)-E(r2 3 - + i)+o(l))_ 

Combining the two estimates, we get 

P ( T < f n ) - Pr ^+i( Tr ^+2 < (Co AT r2j + 1 )) /3 (E( z »(r 23+1 ,r^ +2 ))-iJ(r 2 j_ 1 ,r 2i )+o(l)) 

lr r2 j + i U T r 2j+2 A W ^ T r 2 j + iJ 

(d) All bounds for energies in (c) can be proved in the very same way (here r' G d + V(m)\{s}), 
so that 

p (V - - Pr 2 fc -l( r r-2 fc = (Co Ar r2t-l)) > -0(E(z-(r 2k -^r2 k ))-E(r 2k -3-r2k-2)+o(l)) 

□ 

Now we see for the reciprocating jumps in the accelerated chain: 

Proposition 3.11. Fix 1 < i < n and e > 0. TTien the AAC at level i exhibits no reciprocating 
jumps of order e if the following three conditions hold true: 

(1) E(z*(m ,m 1 )) - E(s mo ) < e for all distinct mo, mi G S^XN^K 

(2) For each m G S^\N^ , th ere exist at least two distinct m\,m 2 g S^\N^\m 7^ 171%, m 2 , 
such that P m (A ?1 G V^(mj)) > for j = 1,2. 

(3) For each pair too, mi G S^\N^' with ¥ mo (X^ 1 G V^ l '(mi)) > 0, there exists a minimal 
uphill-downhill-path from s mo to m\ not hitting any valley but V^(m\). 
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The origin of our endeavor to define aggregations with no reciprocating jumps of an order 
larger than a small e is to obtain an associated process with (almost) decorrelated increments (in 
Euclidean state space), for this and a proper centering causes the variance up to the n-th jump to 
grow with n instead of n 2 . This is known as diffusive behavior in physics. Without aggregation 
increments are highly correlated due to the following argument: at any given time, the process 
is with high probability in a minimum and when leaving it, say by making a positive jump, the 
next increment is most likely negative because there is a drift back to the minimum. Likewise, 
the increments of the asymptotic jump chain are neither uncorrelated nor having mean zero since 
trajectories of Y on an irreducibility class are almost surely of the form mi — > s — > 7712 — s- s —> 
m3 — > where s € iVW and mi,m2,... £ <SM\iVw. Thus, given the previous increments, it is in 
general easy to predict the next increment and they do not have mean zero. On the other hand, 
if we can choose /3 and i such that, for any mo, mi, we have P mo (Y 1 = mi) G {Pm Q ± e} U [0, e] 
for £ <C p m „, we obtain an AAC which behaves roughly like a RW on a graph. Such a RW is 
diffusive if we assume periodic boundary conditions (or sufficiently large state space compared 
to the observation time n) and an energy landscape E which is homogeneous enough to ensure 
zero-mean increments. In particular {m.|P mo (Y"i = m) = p mo ± e} has to comprise at least two 
states. 



3.3 Metabasins 

A path-independent definition of metabasins can now be given on the basis of the previous con- 
siderations. 

Definition 3.12. A finite Markov chain X driven by an energy function E satisfying the as- 
sumptions stated at the beginning of Section [T] has metabasins of order e > if there exists an 
aggregation level i < n— 1 such that the following conditions are fulfilled for each m G <SW\JVW; 

(MB1) sup m , eS( o\( iV (« u{m }) E(z*(m, m')) - E(s m ) < e. 

(MB2) There are at least two distinct mi, m2 € S^\(N^ U {m}) with a minimal uphill-downhill- 
path from s rn to rrik not hitting any other valley but V^(mfc) for k = 1, 2. 

In this case, the valleys {V^ (m)) meS a) are called metabasins (MB) of order e. 

The reader should notice that each singleton set {s} consisting of a non-assigned state s € A/"*- 1 ' 
forms a MB. The conditions (MB1) and (MB2) ensure the good nature of (a) the energy barriers 
and (b) the spatial arrangement of minima. As already pointed out, this determines which valleys 
are visited consecutively. Properties of MB which can be concluded from the results of the previous 
sections are summarized in the next theorem. The reader is reminded of Properties 1-4 stated in 
the Introduction. 

Theorem 3.13. For MB as defined in Definition \3.12\ we have 

(1) The transition probabilities for jumps between MB do not depend on the point of entrance as 
j3 — > 00 (Property 1). 

(2) There are no reciprocating jumps of order e (Property 2). 

(3) The expected residence time in a MB depends on E only via the depth of the MB as (3 — > 00 
(Property 3). 

(4) Regarding only MB pertaining to local minima, the system is a trap model (Property 4). 

Proof: (1) follows from Proposition |3.4| (2) from Proposition 3 . 1 1 1 (3) from Theorem 2.12| and (4) 
directly from the definition. □ 



34 



3 MACROSCOPIC VIEW: TRANSITIONS BETWEEN VALLEYS 




Figure 5: Example of an energy landscape with a tree-like structure. 



It should not be surprising that the path-dependent definition of MB by Heuer [12| and stated 
in the Introduction differs from our path-independent one. For example, the energy landscape in 
Figure [5] has no reasonable path-dependent MB because every transition between two branches of 
the shown tree must pass through the state x. For a typical trajectory, there will be at most three 
MB: the states visited before the first occurrence of x, the states visited between the first and the 
last occurrence of x, and the states visited after the last occurrence of x. The reason for this poor 
performance is the tree- like structure of the energy landscape or, more generally, the fact that the 
connectivity between the branches is too small to allow a selfavoiding walk through more than two 
branches. This results in a small recurrence time for x (compared to the number of states visited 
in between). However, every branch constitutes a MB when using the path-independent definition 
for sufficiently small e, in which case the AAC forms a Markov chain and, given the Metropolis 
algorithm, even a RW on the graph. 

Having thus exemplified that the two definitions of MB do not necessarily coincide, where the 
path-independent approach applies to a wider class of energy landscapes, we turn to the question 
about conditions for them to coincide with high probability. As already pointed out, we have to 
assume a sufficient connectivity between the metastates to ensure the existence of reasonable path- 



dependent MB. In terms of this connectivity (for a precise definition see Definition 3.14) and the 
parameter j3 and e, our last result, Theorem |3 . 1 7| below , provides lower bounds for the probability 
that both definitions yield the same partition of the state space. 

The first step towards this end is to identify and count, for each m G SW and a given /?, the 
states s G S' 1 ' for which a transition of Y from m to s is likely. This leads to the announced 
connectivity parameter. 

Definition 3.14. Let e > and suppose that X has MB of order e > at level i. Define the 

connectivity parameters 



{seAf(n)\V {i) (r)\E(s)<E(n)+e\ , 
{s € d+V^(m)\E{s) < E(s m )+e}\, (14) 



r]i = r]x s := mm 
m = m,e ■= min 

m6S(')\jV(-) 

V3 = V3e ■= min \{s € S^\E(x) < E(n) +e for some x E V^\s) DAf(n)}\. 

r/i is the minimal number of neighboring sites of a non-assigned state n which do not belong to a 
particular neighboring valley and whose energy is at most e plus the energy of n. 772 is the minimal 
number of neighboring sites/valleys of a valley V^(m) whose energy is at most e plus the energy 
of s m and which can be reached via an uphill-path from m. Finally, 773 is the minimal number of 
neighboring valleys of a non-assigned state n which comprise a state with energy of at most e plus 
the energy of n. 771 and 773 are always at least 2 and in fact quite large in the very complex energy 
landscapes of structural glasses. For very small e, 772 may be 1, but if X has MB of order e in a 
high dimensional energy landscape, then 772 can be assumed to be quite large as well. 

That transitions to states counted above have reasonable large probabilities is content of the 
following lemma. Thus, the defined parameters do in fact measure the connectivity of the MB. 
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Lemma 3.15. Let e > and suppose that X has MB of order e > at level i with connectivity 
parameters defined in ( 14 1 . Writing Yk for Yu and V(m) for 7®(m), m £ S^ l \ the following 
assertions hold true for all sufficiently large (3: 

(a) Ifm £ 5W\JVW andse d + V(m)n{x\E(x) -E(s m ) < e}, or m £ N® and s £ satisfies 
V{s) n{i£ N{m)\E{x) - E{m) < e} ^ 0, then 

P m (Y!=s) > e~ 2pe . 

(b) For any distinct m £ N^' and s £ S^ 1 ' , 

P m (n^«) > lie"*, 

(c) For any distinct m £ S^\N^ and s £ S (l \ 

We see that, for e small enough compared to /?, transitions with an energy barrier of at most 
£ are still quite likely and thus a jump to a particular valley quite unlikely in the case of high 
connectivity. 

Proof: (a) Choose /3 > so large that, for /3 > /3 , j < e and P m (F a = s) > e -W{E(s)-E(s m )) 



for any m £ S^\N^> and s £ d + V(m), the latter being possible by Lemma 3.10 Then for any 
such m and s, we infer P m (Yi = s) > e~ 2e P provided that additionally E(s) < E(s m ) + e holds 
true. If m £ N®, then W m {Y x = s) > er 2e(i for any s £ such that E(x) < E(s) + e for some 
x £ V(s) n Af(m), for 

P™(*i = s) > P m (X ai =x) = p(m,x) > e -^((s(x)-E(m))++ 7 ,)_ 

(b) Pick again /3q so large that 7/3 < £ for all j3 > /?o- Then, 

xeN \m) ,x£V (s) 

> £ exp (-/?((£(*) -£(m))++ 7/?)) 

xeJV(ro),x£V(s),-E(a:)<.E(ro)+E 

> 771 exp(-2/3e), 

by definition of 771 . 

(c) Fix /3 so large that P m (y x = x) > e -2/3(B(x)-B( Sm )) for any x £ a+^( 771 ) and (3 > /3 . In 
the very same way as in part (b) , we then get for all j3 > /?o 

P m (Yi^s) > Yl P m {Yx=x) 

> J2 ex P ( " 2 ^ E ( X ) ~ E{s m ))) 

xed+V{m),x^s,E(x)<E(s m )+e 

> { V2 - 1) exp{-2/3s), 

by definition of 772. □ 

The above result motivates that in the case of high connectivity the probability to revisit a 
particular valley within a fixed time T is quite small, or in other words, the probability for the 
AAC to jump along a selfavoiding path is quite high. This is the main step towards the announced 
theorem and stated below. The observation time T of course has to be small compared to the 
cover time of the process. 
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Lemma 3.16. Let e > and suppose that X has MB of order e > at level i with connectivity 



parameters defined in (14). Writing for Y^ and V(m) for V^(m), m G define 



4\ m) : =inf{fc>l|y fe =m}. 
Then the following assertions hold true for all sufficiently large f3: 
(a) Forany0<5<l- ¥ m (Y 2 = m) andl<T < T(m, j3) + 1, 

where 



ln(min m ^ m P m /(Yi ^ m)(l - l{ m 'gjvM} s ( m '>P))) 
and 

6(m',f3) : = max > e(x,m' \z, 0). 

xeV(m') 

z<£d+V(m') 

In particular, if 6 < ((r/i A (r/2 — 2))e~ 2,3e ) T for some T > 0, then T(m, f3) > T. 
(b) For each k > 1 and mo G 5^, 

fc-l 

mi,...,m fc j=0 

where summation ranges over all pairwise distinct mi,..., mj. € 5"\{mo} and for N G N we 
write [N] k := N(N - 1) • ... • (N - k + 1). 

It should be noticed that PmfjyL) > 1) = 1 (the AAC never stays put) and 

P ™(4m > T ) < p ™(^( m) > 2) = 1 - P m (F 2 (i) = m) 

for every T > 2 with equality holding only if T = 2. We thus see that P m ( T |/( m ) > > 5 entails 
S < 1 — P m (V2 = ™), the latter being typically large. Furthermore, the bound on the number of 
self-avoiding path of length k is very crude and can be improved when knowing more about the 
spatial arrangement of the metastable states. 



Proof: (a) Recall from the first part of the proof of Proposition 3.4 that 



,{Y n+l £ z\Y n = y, X an = x) > P tf (y x ^ z) 1 - l { yt N (i) } J2 V ^ Tr < T v 

\ r£d+V(y) 

> ¥ y (Y 1 z) 1 - l {ytfTWy ^V^P) 

\ r£d+V(y) 

> F y (Y 1 ^z)(l-l {ytlfW} S(y,0)) 



holds true for all y,z G 5^, x G V(y) and /3 > 0. This will now be used repeatedly to show that 



■{4\m) > T ) > ( P m ,(Yi ^ m)(l - l {m ,^ W} <5(m', /?))J 



T-l 
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for each T > 2. Putting m{x) := m! if x G V{m!) for m! £ we obtain 

r (i) > T 

= P m (Yi ?m,...,Y T ?m) 

T-2 

= W>m(X (Tl =X 1 )Y[Wm{X (Tk+1 =X k+1 \X ak =Xk,Yk=m(Xk)) 

ai,...,xx-i^V(*)(m) fe=l 

x P m (V T ^ m|A CTT _ 1 = xt-i,Y t _i = m(a: r _ 1 )) 

T-2 

> p m(^i = x i) n v m(X ak+1 = x k+1 \X <Jk = x k ,Y k =m(x k )) 

x 1 ,...,x T _ 1 ^V l . i '> (m) fc=l 

x (min (P m .(y^m)(l-l |ra ,^„ } V^))) 



> min (p m ,(Yi ^ m)(l - l {m '*wM} ^(m',/3))) T " ' 



But this establishes the asserted inequality when finally observing that the last expression is > 5 
iff T < T(m,/3) + 1. 

Having just said that T(m, (3) >T holds iff 

min (v m ,(Y 1 ^m)(l-l {m ,^ W}( 5(m',/3))) T > 5, 
it suffices to note that, as — > oo, 5(m',/3) -> holds true if m' G S«\7VW, giving 

l-5(m',/3) > ^ 
% - 1 

for sufficiently large /3. Together with Lemma 3.15| fo), this further yields 



min (P m ,(^i^m)(l-I {m ^ jv(l)}( 5(m',/3))) T > (( m A ( m - 2)) e - 2/3e ) T 
and then the assertion. 

(c) Here it suffices to notice that, by (a), [772 A 773 forms a lower bound for the number of 
self-avoiding paths (m ,...,m k ) such that ¥ mj (Y 1 = m,j + \) > e~ 2/3e for each j — 0,..., k — 1. □ 

We proceed with the announced result about the relation between path-dependent and path- 
independent MB. To this end, we fix T — ax for some K 6 N. Let V k for fc = l,...,v denote the 
random number of MB obtained from -Xq,..., X? as defined in the Introduction. For 1 e 5, we 
further let V(x) denote the MB V k containing x and put V(x) := if no such MB exists which is 
the case iff x ^ {Xq,..., Xt}- 

Theorem 3.17. Let e > and suppose that X has MB of order e > at level i with connectivity 
parameters defined in (141. Fix K G N, T = cr^- and < 8 < ((rji A (772 — 1) — l)e~ 2 ^ e ) . Then, 



for each < k < K and too G S^- l \ there exists /3q > swc/i i/iai for all /3 > /?o 

(V Pmo^V^Yfc) C V(Yfc)) > 1 - (max m£S( ,) W (,) |V<(m)| + 2) max meS( o W (o <5(m, /3), iota 

^ l) (s) :={s} ifseN®. 
(b) P mo (V(^-) C < 3 < k) > 1 - fc(max me5(l ) W(l) «5(m,/3) + (1 - 5)). 
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(c) IfV2 A 773 > K — 1) then 

P mo (V(y}) cyW(^), 0<j<K-l) > [t]2 A m ] K ( 1 - max <5(m,/3)) e" 2 ^. 

\ m£5W\iVl') / 

For the occurring bounds to be significant, two requirements must be met. First, K must 
be small compared to the cover time of the AAC and e must be small compared to /3q to ensure 
exp(— 2f3e) ^ 0. Second, the connectivity must be high to ensure 1 — S <C 1 and [r/2 /\f]3]Ke~ 2K£ ^ ^ 
0. 

Typically, the inclusions in parts (b) and (c) are strict because of high energy states within a 
valley that will probably be missed during one simulation run and therefore not belong to any 
path-dependent MB. On the other hand, since our approach strives to cover the state space as 
completely as possible by valleys the latter comprise such high energy states whenever they are 
assignable in the sense described in Section [T] 

Proof: With i being fixed, let us write as earlier V(m) for V^(m), and also V<(m) for V<\m). 
(a) For a given < k < K, define 



Bk 
C k 



= {o-fc < Ty k < CT k+ i}, 

= { for every x £ V < (m) exists Ty k < l x < o~k such that Xi ~ x}, 

= {Xi = Yk for some max t x < I < end. 

xev<(Y k ) 



With (5(m, f3) as defined in Lemma |3. 16 and using 

Px(<7i<r m ) < y £(x,m,y,i3) < max 5(m,f3) =: <5 max (15) 

/-f meSW\JVW 
y£a + V(m) 

for x S V(m), m £ S^\N^ l \ we obtain 

> ¥ mo (A k nB k nc k ) 

Y p ™« =r}nA k nB k nc k ) 

Pmo {X ak = r)P mo (A k \X ak = r)P mo (B k n C fe |{A CT(c = r} n A k ) 

mgS(') ,rGV(m) 

]T P roo (X CTfc = r)P r (r m < a!) 

meS('>,r£y(m) 

x Pn^Tj; < di for every x € V < (m), Xi — m for some max r x < I < ai) 

i£V<H 

> J] P mo (X ffJ = r)(l-<5 max ) 

me5< i ),r'ey(m) 

x P m (r 2 . < (T! for every x € V < (m), Xi=m for some max t x < I < o~i) 



(l-^max) P mo (**="*) 



mGSW 

x P m (r 2; < (Ti for every a; € 1/<(to), A; = m for some max t x < I < o~\). (16) 

x£V < (m) 

Thus, in order to show that with high probability a path-dependent MB comprises the inner part 
of a valley, we show that with high probability, when starting in its minimum, the whole inner 
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part will be visited and the process will return to the minimum once more before the valley is left. 
This is trivial if m G iVW and thus V<\m) = {m}, for then 

P m (T|, < <j\ for every x G V<(m), Xi — to for some max r x < I < a±) = 1. 

x£V < (m) 

More needs to be done if m G S^\N^ l \ where 

< ci for every i£ V< (m) , Xi = to for some max t x < I < a\ 
> 1 — P m (t x > a i for some x € V < (m)) — P m (A; m for each max t x < I < aA 

x£V < (m) 



The second probability in the preceding line can further be bounded with the help of (15 1, viz 



i(Xi m for each max t x <l<<7\) 

iev<(m) 

= > P m ( max t x = t v , Xi ^ m for every max t x < I < <j\) 
f-^ xev<(m) xev<(m) 

S/6V<(m) 

< P m ( max r x = r t/ )Pj / (r m > o-i) 

— ^ a3GV<(m) 
y£V < (m) 

< 6- 



max • 



while for the first probability, we obtain with the help of Theorem 2.1 



Pm (t x > ci for some x G V<(m)) < P m (ffi < t^) 

iE^ (m) y6<9+ V(m) 

< E E <rn,x,y,^), (17) 

iEV< (m) t/£9+ V(m) 

because E(z* (m,y)) > E(z* (m,x)) for a; G V < (m) and y £ <9 + T^(to). The latter can be seen as 



follows: It has been shown in the proof of Theorem 2.2 that E(z*(x,m)) < E(z*(x,y)). Hence, 

E{z*[x,m)) < E(z*(x,y)) < E(z*(x,m)) V E(z*(y,m)) = E(z*(y,m)) 
as asserted. Next, we infer 

E(z*{x,y)) <E(z*(x,m))W E(z*(m,y)) < E(z*(x,m)) V E{z*(x,y)) = E(z*(x,y)), 
thus E(z*(x,y)) = E(z* (m,y)). Recalling the definition of e(m, x,y, (3), the last equality implies 



e(m, x, y, /?) = e(x, to, y, ft) which will now be used to further bound the expression in (17), namely 



x£V < (m) y£d+V(m) xeV < {m) y£d+V (m) 

= 22 22 e{ x ' m >v>P) 

max 

< max |V<(m)| <5 max . 
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Together with ( 16 1 this yields as asserted 



,(#»)cv(n) 



> (1-<W> X Pm (Y k =m) 



x P m (Tj; < c i for every a; € V<(m), X[ = m for some max < Z < a\) 

x£V < (m) 



> (1-^max) 1- max |7<(m)| + l <y n 



> 1- max |V<(m)| + 2 U n 



(b) According to Lemma 3.16| choose /3 > such that 

(r$ m) < ^) < 1 - S (18) 



max . 

meSW 



for each (3 > /? . By using (18 1 and (151, we now infer 
P, mo [Yi = Yfc for some k + 1 < I < K) 

= X P "io ( Y i = Y k for some k + 1 < I < K, X„ h = s) 

< J! P ™o (-Xa* = «) ( p s ( T V(Y ) 

< K — k, Xj = m(s) for some < j < a%) 

+ 1{ S ^W} ^(4\V ) <K-k,X^ m(s) for all < j < 

< ^P mo (I fft = s)F m(s) (4l Yo) <K-k)+ X V n l0 (X ak = s)F s (a 1 < r m(s) ) 

s€S sgiVW 

< 1-8 + <5 max , 

and finally 

(fc-i 
fl {Yi^Y^j + l<l<K} 
3=0 
fc-1 

> 1 - V V mo (Yi = Yj for some j + 1 < I < K) 

3=0 

> l-k(6 max + (l-6)). 

(c) In the following calculation, let ro = tuq, ^2 m . range over all if- vectors (mi,...,rajf) with 

pairwise distinct components in 6>W\{mo} and, for each k < K, let Y^n r k ran g e over k- 
vectors (ri,...,rfe) such that rj G ^(m.,-) for each j = 1,..., fc. As in part (b), use ( 15 1 repeatedly to 
infer 

P^(V(y i )cV«(Yi),0<i<A--l) 

- X! X! Pm ° ( fl ft = m i' = r i> < °J+0 n ft^ = m *> 

rrij n,...,rK-l \j=0 
X-2 

= X X II Pr J ( Y ° = m 3> X °i = r i+l' T TOj < a l) P ^K-l ( T m K -l < °l) PmK-iC^l = m A') 
rrij ri,...,rjt_i j=0 

A' -2 

> (i - w) xx n p ^ (y° = m j> x °-i = r i+i> < (^i = m A-) 

t?Xj n,...,rjc-i i=0 
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11 13 15 s 




Figure 6: (a) 2-dimensional modification of the energy landscape from Example 1.2 

(b) sup m , eS (i)\ N (i) \E(z*(m, m!)) — E(s m ) \ for the various metastable states in S^\N^ % 
in dependence of the level 1 < i < tl. 



K-3 



mj ri,...,r A _ 2 i= 



K-3 

= (1 - <5max) X! X! IT Pr J ( Y ° = m h X cri = r J + l' T ™j < 
mj ri,...,fx-2 j— 

x P rA -_ 2 (Yi = mjf-i, r mff _ 2 < (Ti) P mK _ 1 (Yi = m K ) 

K-3 

> (1 - 5 max ) X! X! II Pr J = m h X <n = r i+l' T '»j < 

rrij ri,...,r*r-2 2=0 

x P m A ._ 2 (Yi = TOi<r_l) Pr A _ 2 (T mA _ 2 < CTl) P mK -l (^1 = 

> (i - o 2 n p ^ ( r ° = m ^ = r J+!' r »b' < 



nij n,...,rjf-2 j=0 



x F m A -2 (*1 = "lA'-l) Pm K _i(Yi = m K ) 



K-l 



> (1 - ^ma*)*" 1 X II P ^ ( Fl = 

> (l-<5max) K - 1 feA % ]Ke- 2Ke/3 , 

the last line following from Lemma [3.16| 



□ 



Example 3.18. We return to Example |1.2| given in Section [T] but modify the energy land- 
scape by allowing direct transitions between some saddles (see Figure [6] (a)) because (MB2) can 
clearly not be fulfilled in a one-dimensional model. While having no effect on the metastable 
states m £ valleys change in the way that, for levels i G {5,6}, the states {1,2,3} do no 

longer belong to the valley around state 4 and {1, 2} forms its own valley. The energy-differences 
sup TO / gS (i)\ n m E(z*(m, m')) — E(s m ) of the various metastable states m at each level 1 < i < n = 7 
are shown in Figure [6] (b) . The supremum of these energy differences decreases in i, and we obtain 
MB of order 1 for i > 4, and of order 0.5 for i > 6. 

To illustrate the behavior, we have run a Metropolis Algorithm on this energy landscape. For 
initial state s — 4 and f3 — 0.75, the energies of the trajectories of the original chain as well of the 
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aggregated chain at levels i = 3, 4, 5 are shown in Figure [7] The following observations are worth 
to be pointed out: 

• The number of reciprocating jumps decreases with increasing level of aggregation. 

• The deeper the valley, the longer the residence time. 

• The motion in state space is well described by the aggregated process. 

• Due to the very small size of the state space and a long simulation time, valleys are revisited. 




i=4 



T" 



T" 



T" 



T" 



100 200 300 400 500 



i=5 



1 1 1 1 1 r 

100 200 300 400 500 



Figure 7: Energies of the true trajectory and of the trajectories of the aggregated chain at levels 
i = 3,4,5. 
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